arXiv:l508.0526lvl [math.PR] 21 Aug 2015 


Regularity structures and the dynamical model 


August 24, 2015 


Martin Hairer 

The University of Warwick, Email: M. HairerOWarwick .ac.uk 


Abstract 

We give a concise overview of the theory of regularity structures as first exposed in IIHai 141 . In 
order to allow to focus on the conceptual aspects of the theory, many proofs are omitted and 
statements are simplified. In order to provide both motivation and focus, we concentrate on the 
study of solutions to the stochastic quantisation equations for the Euclidean quantum field 
theory which can be obtained with the help of this theory. In particular, we sketch the proofs of 
how one can show that this model arises quite naturally as an idealised limiting object for several 
classes of smooth models. 
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The purpose of these notes is to give a short informal introduction to the main concepts of the 
theory of regularity structures, focusing as an example on the construction and approximation 
of the dynamical 4>| model. They expand and complement the notes llHail51 which focus more 
on the general theory and the ideas of the proofs of the main abstract results. Here, we instead 
focus mainly on the construction of the dynamical <$3 model, as well as on the way in which the 
abstract theory can be used to obtain a number of rather non-obvious approximation results for 
this model. 

The theory of “regularity structures”, introduced in HHail41 . unifies various flavours of the the¬ 
ory of (controlled) rough paths (including Gubinelli’s theory of controlled rough paths l Gub04l . 
as well as his branched rough paths I Gub 1 Oil ), as well as the usual Taylor expansions. While it 
has its conceptual roots in Lyons’s theory of rough paths |Lyo98|, its main advantage is that it is 
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no longer tied to the one-dimensionality of the time parameter, which makes it also suitable for 
the description of solutions to stochastic partial differential equations, rather than just stochastic 
ordinary differential equations. This broader scope requires a theory that admits more flexibility 
than the theory of rough paths, so we will see that the underlying algebraic structure (which in 
the case of the theory of rough paths is always given by the tensor algebra endowed with the 
concatenation and shuffle products) is problem-dependent and enters as a parameter of the theory. 
While the exposition of these notes aims to be mostly self-contained, none of the proofs will be 
given in detail, instead we will only sketch the main arguments. 

The main achievement of the theory of regularity structures is that it allows to give a mean¬ 
ing, as well as a robust approximation theory, to ill-posed stochastic PDEs that arise naturally 
when trying to describe the crossover regime between two universality classes for various sys¬ 
tem of statistical mechanics. One example of such an equation is the KPZ equation arising as a 
natural model for one-dimensional interface motion in the crossover regime between the Edwards- 
Wilkinson and the KPZ universality classes HKPZ861IBG971 |Quall[ IHail 311 : 


d t h = d 2 x h + (d x h) 2 + £ — C . 


(KPZ) 


Another example is the dynamical <I> : ] model arising for example in the stochastic quantisation of 
Euclidean quantum field theory IIPW811IJLM851fAR911IDPD031IHail4l . However, it also arises 
as a description of the crossover regime for the dynamic of phase coexistence models near their 
critical point between the mean-field theory and the “Wilson-Fisher” renormalisation fixed point 
I1GLP991 [MW 141 . This model can be written formally as 


<9t<b = Afi> + C<b - $ 3 + £ . 


( q> 4 ) 


In both of these examples, £ denotes space-time white noise, namely the generalised Gaussian 
random field such that 

E£(t, x)£(s, y) = 6(t - s)5(y - x) . (1.1) 

Furthermore, C is a constant (which will actually turn out to be infinite in some sense!), and 
we consider these equations on bounded tori. In the case of the dynamical <f>| model, the spatial 
variable has dimension 3, while it has dimension 1 in the case of the KPZ equation. Why are these 
equations problematic? As one can guess from (II.II) . typical realisations of the noise £ are not 
functions, but rather irregular space-time distributions. As a matter of fact, it follows immediately 
from (11.11 ) that the law of £ is invariant under the substitution £(i, x) t-x A 2 +1 £(A 2 £, Ax), which 
correctly suggests that its samples typically belong to the parabolic C a spaces only for ct < — 1 — 
(We write C a as a shorthand for the parabolic Besov space ^ loc , which coincides with the 
usual parabolic C a spaces for positive non-integer values of a. It is in some sense the largest 
space of distributions that is invariant under the scaling </?(■) i->- A _C V(A _1 -), see for example 
IBP081 .) 

As a consequence, even the solution u to the simplest parabolic stochastic PDE, the stochastic 
heat equation dtu = Am + £, is quite irregular. As a consequence of the above, combined with 
classical parabolic Schauder estimates, one can show that if the spatial variable has dimension d, 
then the solution u belongs to C a (again in the parabolic sense) for every a < but not for 
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= 4-“. In particular, one expects solutions h to (1KPZI) to be of class “just about not” Holder- 


1 

2 ' 


This begs the following question: if typical solutions h are nowhere differentiable, what does the 
nonlinearity (d x h) 2 mean? Similarly, we see that one needs to take a < 0 in dimensions 2 or 
higher, which means that in these dimensions the solutions to the stochastic heat equation are no 
longer functions themselves, but only make sense as space-time distributions. As a consequence, 
it is not clear at all what the meaning of the nonlinearity ( I> ,! is in the dynamical 4>j model. 

This is the type of question that will be addressed in this article. It is hopeless to try to build 
a consistent theory allowing to multiply any two Schwartz distributions, as already pointed out 
by Schwartz himself I ISch54H . It is possible to extend the space of Schwartz distributions to some 
larger class of objects that do form an algebra (and such that the product of a Schwartz distribution 
with a smooth function has its usual meaning), and this has been explored by Colombeau I Col831 . 
However, while this frameworks does allow one to treat stochastic PDEs 1 AHR011 . it is not clear 
at all what the meaning of the resulting solutions are. In fact, there are very strong hints that 
solutions built using such a theory are not “physically relevant”: in the case of the sine-Gordon 
model they fail to “see” the Kosterlitz-Thouless transition [FS8I, lFa!121 . while the theory of 
regularity structures does break down precisely at that transition I1HS141 . as expected for a theory 
designed to treat “subcritical” models. 

Instead, the direction pursued here is to exploit as much as possible a priori information on the 
model at hand. Instead of building a general theory allowing to multiply any two distributions, we 
are only interested in multiplying those distributions that could potentially arise in the right hand 
side of the equation under consideration. The theory of regularity structures provides the tools 
and techniques to design such “purpose-built” spaces in a systematic way, as well as the objects 
required to encode the various renormalisation procedures arising in these problems. While a 
full exposition of the theory is well beyond the scope of this short introduction, we aim to give 
a concise overview to most of its concepts. In most cases, we will only state results in a rather 
informal way and give some ideas as to how the proofs work, focusing on conceptual rather than 
technical issues. For precise statements and complete proofs of most of the results exposed here, 
we refer to the articles [ Hail4ilHS15l HQ15(lHX15l . The type of well-posedness results that can 
be proven using the techniques surveyed in this article include the following. 


Theorem 1.1 Let = g £ * 8 denote the convolution of space-time white noise with a compactly 
supported smooth modifier o that is scaled by e in the spatial direction(s) and by e 1 in the time 
direction. Denote by 'Iu the solution to 


d t <s> £ = A$ e + C7 e $ e - , (1.2) 

on the three-dimensional torus. Then, there exist choices of constants C e diverging as e —>• 0, as 
well as a process <I> such that <lu —>■ d> in probability. Furthermore, ifC £ is suitable chosen, then 
<I> does not depend on the modifier o. 

Remark 1.2 Very similar results have recently been obtained by slightly different techniques. In 
ICC 131 . the authors use the theory of paracontrolled distributions introduced in HGIP121 . This 
theory is a kind of Fourier space analogue to the theory of regularity structures, but seems for 
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the moment restricted to first-order expansions. In [Kupl5] on the other hand, the author uses a 
variant of Wilson’s renormalisation group ideas. 


Remark 1.3 We made an abuse of notation, since the space-time white noise appearing in the 
equation for h E is on R x T\ while the one appearing in the equation for <Iu is on R x T 3 . 
Similarly, the modifier g £ is of course different for the two equations. 


Remark 1.4 In both cases, convergence is in probability in C a for 0 < a < -7 in the case of the 
KPZ equation and — | < a < — ^ in the case of the <1 > 4 model. This requires the corresponding 
initial conditions to also belong to the relevant C a spaces. 

It is also possible to show that various natural approximation schemes converge to the dynami¬ 
cal <I> : i model. Take for example a function 6 H > Vg taking values in the space of even polynomials 
on R of some fixed degree 2m and consider the equation 

dt$ £ ,e = Adu e — e _3//2 V r 0 , (v / c^ > e,0) + £ e , (1-3) 


with £ e as above. This particular scaling arises naturally when rescaling a weakly nonlinear 
model, see [1HX151 for more details. Denote by p = Af(0, C) the centred Gaussian measure with 
covariance 

C = || q * T > || 2 2 j (1.4) 


where P denotes the 3-dimensional heat kernel and g is the modifier appearing in the definition of 
£ e . Note that since d = 3, this quantity is indeed finite. (In dimensions d < 2 the heat kernel fails 
to be square integrable at large scales.) We then define the “effective potential” (Vg) = // * Vo and 
assume that 9 i—)• (Vg) exhibits a pitchfork bifurcation at the origin at 9 = 0. We also normalise 
the solution in such a way that (V'o) (4) (Oj = 6 , which guarantees that (Vq)\u) = u 3 + 0{u 5 ) for 
u <C 1. The main result of lfHX15l . which builds on the analogous results obtained in | HQ15| for 
the KPZ equation then reads 


Theorem 1.5 In the above setting, there exist values a > 0 and h G R, such that if one sets 
9 = as log e + be, then the solution to (11.31) converges as e —>• 0 to the process <J> built in 
Theorem 17.71 

Remark 1.6 Concerning the initial condition, one needs to consider a sequence Tjy' of smooth 
initial conditions converging to 4>o € C a with — 5 — 4777 < a < — ^ in a suitable sense. This 
is because if we chose a fixed initial condition in C -1 / 2 say, (11.31) may fail to even possess local 
solutions for fixed e > 0 . 

We can also consider approximations of the type given in Theorem 11.11 but with a non- 
Gaussian approximation to white noise. In this case, one typically has to add additional countert¬ 
erms in order to obtain the same limit. Consider for example an approximation of the type 

€e(t,x) = e~ 5 / 2 p{t/e 2 ,x/e) , 


( 1 - 5 ) 
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for a stationary process ijonRx R which admits moments of all orders and has finite dependence 
in the sense that a k and a f -- are independent whenever 


inf \z — z\ > 1 . 

zeK 

zeK 

Here, we wrote a k for the cr-algebra generated by all evaluation maps 77 ( 2 ) for z € ivQit is then 
possible to show that there exist suitable choices of constants C® such that solutions to 

dt^e = + Cf ] + C ®~^ £ +^, (1.6) 


converge as e 0 to the process <b built in Theorem 11.11 Details of the proof can be found in 
IIHS1511 for the case of the KPZ equation. It is shown in ILHS151 , Remark 6 . 6 ] that these constants 
are of the type 


CP 


c a,i) £(i,2) 

737^ + 717^’ 


< 7 ( 2 , 1 ) 

Cf = --+ clogs + (7 (2,2) , 

£ 


(1.7) 


where the depend on the details of the process 77 , but c is universal. We will provide a short 
sketch of the argument showing how these constants appear in Section 15711 below. 

Another very nice way in which the dynamical <3 ? 4 model appears is as the crossover regime 
for the Glauber dynamic in an Ising-Kac model IIMW14I1 . Here, one considers an Ising model, 
but with the usual nearest-neighbour Hamiltonian replaced by a Hamiltonian of the type 


H = K i(\ x ~ y\) a xv x , 

x,y 

where K- f (r) = ^y~ d K('yr) for some small value 7, and K is a positive, smooth, compactly 
supported function. On is then interested in the simultaneous limit 7 —>■ 0 and N ^ oc (where 
N is the side length of a discrete torus on which a lives) at (or sufficiently near) the critical 
temperature. In dimension d = 2, it was shown in (MW 141 that by suitably choosing N and the 
inverse temperature as a function of 7 and considering the Glauber dynamic on suitable long time 
scales, one recovers the dynamical T.j model in the limit. A similar result is of course conjectured 
to hold for d = 3. We will not consider any discrete approximation of this type in these notes 
but refer instead to 1CW151 for a recent review of the techniques involved in the proof of such a 
result. 

One of the main insights used in the proof of the type of results mentioned in this introduction 
is that while solutions to equations like (1KPZI) or dd* ! l ) may appear to be very rough (indeed, they 
are not even functions in the case of d ( I> 'll ), they can actually be considered to be smooth, provided 
that one looks at them in the right way. To understand what we mean by this, it is worth revisiting 
the very notion of “regularity”. The way we usually measure regularity is by asking how well 
a given function can be approximated by polynomials. More precisely, we say that a function 
F : R d -> R is of class C 1 with 7 > 0 if, for every point x € R d we can find a polynomial P x of 

1 Since we always consider equations on bounded tori, we actually need to consider a suitably periodised version 
of £ e , see IHS151 Assumption 2.1] for more details. 
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degree [ 7 ] such that \F(y) — P x (y)\ |y — x| 7 for y close to x. The idea now is to consider spaces 
of “regular” functions / distributions, but where “regularity” is measured by local proximity not 
to polynomials, but to linear combinations of some other “basis functions” that are specific to the 
problem at hand. In particular, the objects that play the role of polynomials are allowed to be 
random themselves, and they are also allowed to be distributions rather than just functions. One 
might now hope (and this is indeed the case) that if we are given a consistent product rule on these 
basic objects and if we have two distributions that are sufficiently “smooth” in the sense that they 
are locally described by linear combinations of these objects (a “local specification”), then there 
exists a unique distribution, which we interpret as the product, whose local specification is given 
by the products of the local specifications of each factor. This then allows to formulate equations 
like (1KPZI ) or (IT 1 ! ) as fixed point problems in some of these spaces of “smooth” functions / 
distributions, and to build a local solution theory with many nice properties very similar to those 
one would have for the corresponding deterministic problems with smooth inputs. 

The remainder of these notes is organised as follows. First, in Section 0 we provide a fresh 
look at the definitions of the classical spaces C 7 and we describe a natural generalisation, as 
well as a situation where such a generalisation is useful. In Section [3} we then give the general 
definition of a regularity structure and of the associated analogues to the spaces C 7 . We also 
formulate the reconstruction theorem which is fundamental to the theory and we give a simple 
application. We then provide some operational tools in Sections [3721 and [331 Finally, we show in 
Sections [4] and 14.31 how to apply the theory to stochastic PDEs in general and the dynamical 
model in particular. 
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2 Another look at smooth functions 


Before we turn to a more precise description of how this idea of replacing Taylor polynomials 
by purpose-built functions / distributions is implemented in practice, let us first have a slightly 
different look at the definition of the classical Holder spaces. For the sake of this discussion, let 
us fix some exponent 7 € (1, 2). The space C 7 (5 1 ) then consists of those functions / that are 
continuously differentiable and such that their derivative /' satisfies \f'(t) — /'(s)| < \t — s| 7_1 . 
A natural seminorm on C 7 is then given by 


II/II7 = SU P 

S^t 


1 m - f( S )\ 

\t - si 7 " 1 


(2.1) 


where f denotes the derivative of /. This definition is more complicated than it may appear 
at first sight since it is given in terms of /', rather than the function / itself, and f has to be 
computed first. Instead, we would like to consider an element of C 7 as a pair of functions (/, /'), 
endowed with the norm 


IK/, /')ll 7 = sup - ff 1 V sup - /W - /,(S)( * - s)l 


S^t 


If - si 7 " 1 


S^t 


|f — s| 


( 2 . 2 ) 
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where a V b denotes the maximum of a and b. It is not difficult to verify that if f is the derivative 
of /, then as a consequence of the identity 

fit ) - /(s) - f'{s){t s') — f (/'(r) - /'(s)) dr , 

J S 

the seminorm (12.21) is indeed equivalent to (12.11 ). Furthermore, the finiteness of the second part of 
the expression ( 12 . 21 ) forces f to be differentiable with derivative /', since otherwise that supremum 
would be infinite. In this point of view, we therefore do not have to impose a priori that f is the 
derivative of /, this simply turns out to be a side-effect of having ||(/, /')|| 7 < oo. Furthermore, 
viewing the pair (/, /') as our primary data has the advantage that we do not need to compute any 
additional data (the derivative) from it in order to express its norm. At this stage, one may worry 
that the notation quickly becomes cumbersome when considering higher degrees of regularity. 
Indeed, for 7 E (2, 3) we could come up with a similar norm, but this time we would have to 
consider triples (/, /', /"), etc. Instead, we prefer to rewrite an element of C 7 as a single function, 
but taking values in R © R « R 2 , with the first component equal to / and the second component 
equal to f. In order to be able to easily distinguish these components, we denote by 1 the basis 
vector corresponding to the first component and by X the basis vector for the second component, 
so that our pair (/, f) can be written as 

Fit) = f{t)\ + f\t)X . 

At this stage, this is nothing but a change of notation. However, this notation already suggests a 
very natural product rule for elements in C 7 : we postulate that 11 = 1, 1 ■ X = X ■ 1 = X, and 
X ■ X = 0, and we define (F ■ G)(t) = F(t) ■ G(t). With this definition, one has 

(F ■ G){t) = f(t)g(t)l + if'it)g{t) + f(t)g'(t))X , 

i.e. the component multiplying X is “automatically” given by what the Leibniz rule suggests 
should be its correct expression. 

Each of the basis vectors r E {1, A} also comes with a natural “degree” (or “homogeneity”) 
|r| given by |1| =0 and X| = 1. There is an analytical meaning to this homogeneity: the 
vector 1 also quite naturally represents the constant function 1 and the vector X represents the 
monomial of degree 1 in a Taylor expansion around some base point. In this way, we can view 
F as a function taking values in the space of Taylor polynomials of degree 1. This suggests 
the introduction of a family of linear maps {n s } se5 i which associate to each basis vector the 
corresponding Taylor monomial: 

(n s l)(t) = 1 , (n s X){t) = t- s . (2.3) 

With this notation, given F E CF, the function t i-a (n s i ? (.s))(f) is nothing but its first-order 
Taylor expansion at s. Furthermore, the degree of a vector r E {1 , AL } is precisely the order at 
which the map t fll s r)(t) vanishes near t = s. The maps 1 f, yield an easy way to recover the 
“actual” real-valued function described by the vector-valued function F: setting 

(KF)(t) = ( n t F{t))(t ) , 


(2.4) 
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we see that in this case 7 ZF is nothing but the component of F multiplying 1, which is indeed the 
real-valued function / represented by F. One crucial property of the maps II s is that varying s 
can also be achieved by composing them with an adequate linear transformation. Indeed, setting 

r s fl = 1 , T st X = X + (s — t)l , (2.5) 


it is straightforward to verify that one has the identities 

n t = u s r st , = r su . 


(2.6) 


With this notation, and making a slight abuse of notation by also using {1,26} for the dual 
basis, the norm (12.21) can be rewritten in a more concise and much more natural way as 


F|| 7 = sup 
re{l,X} 


I <T, F(t) - T ts F(s))\ 
|f-s|T'-l r l 


(2.7) 


It is now immediate to generalise these definitions to the case of 7 € R+ \ N by introducing 
additional basis vectors X k , postulating that X° = 1 and X k ■ X f = X k+e , and extending the 
definition of the maps T s ./ to these additional vectors by imposing that it satisfies the multiplicative 
property 

r.st(rf) = (T st T)(T st f) . 

One can then verify that (12.61 ) still holds provided that one sets (II s X fc )(f) = (t — s) k . This is a 
consequence of the fact that first changing the base point s and then multiplying two monomials 
is the same as first multiplying them and then changing the base point. 

Note now that the definition of the seminorm (12.71 ) is extremely robust and does not refer 
anymore to any of the details that make a Taylor polynomial a polynomial. For example, for 
a G (0,1) we could fix an a-Holder continuous (in the usual sense) function W and, instead of 
dH, we could set 

(I I s i)(t) = 1, (n s x){t) = W{t) - W{s). 

Since (II s 26)(f) now only vanishes at order a near t = s, this strongly suggests that we should 
set | X | = a. The identity (12.61) is still satisfied if we change the definition of T s t into 


T st l = 1 , T st X = X + (FF(s) - W(t)) 1 . (2.8) 

With these definitions, the norm (12.71 ) is still very natural, but its meaning now is that, setting 
F{t) = f(t)l + f'{t)X as before, the function f is Holder continuous of order 7 — a and the 
function / is such that 


m = f(s ) + f\s){W(t) - W{8)) + Oi\t - sP) . (2.9) 

In particular, if 7 > a, it no longer implies means that / is of class C 7 , only that its increments 
“look like” some multiple of those of W, up to a remainder of order 7. Furthermore, f is now no 
longer equal to the derivative of /. 
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Remark 2.1 Consider the interesting case 0 < a < 7 < 1 for the situation just described. Then 
it is no longer necessarily true that the “derivative” f is uniquely determined by / if all we know 
is that 11 F |j ry < 00. If for example we take Wit) = cos(f) which is certainly a-Hdldcr continuous, 
although it is of course much more than that, then the identity (12.91 ) simply forces / to be 7 - 
Holder continuous and puts no restriction whatsoever on /'. If, on the other hand, W happens to 
be “nowhere 7 -Holder” in the sense that for every s € S' 1 one can find a sequence t n -A s such 
that | W{t n ) — W{s)\/\t n — s | 7 —>• 00, then (12.91) determines f uniquely. A quantitative version 
of this statement, together with an application, can be found in HHP131 . 


The functions of class “C 7 with respect to some function W" just described in (12.91 ) play a 
prominent role in the theory of controlled rough paths | Lyo98( [Gub041 ILCL071 . so let us see 
how these definitions are useful there. The setting is the following: we want to provide a robust 
solution theory for a controlled differential equation of the type 


dY = / (Y) dW(t) , 


( 2 . 10 ) 


where W £ C a is a rather rough function (say a typical sample path for an m-dimensional Brow¬ 
nian motion). In general, we allow for W to take values in R m and Y in R" for arbitrary integers 
n and m, so we cannot solve (12.10b by simply setting Y(t) = Z(Wit)) with Y the solution to 
the ODE Z = f(Z). It is a classical result by Young I You36ll that the Riemann-Stieltjes integral 
(Y. W) i -> f 0 Y dW makes sense as a continuous map from C a x C a into C a if and only if a > 

As a consequence, “naive” approaches to a pathwise solution to (12.10b with f () f(Y ) dW inter¬ 
preted in Young’s sense are bound to fail if W has the regularity of Brownian motion, since the 
modulus of continuity ui(h) of the latter behaves no better than uj(h) = \/‘2h log 1 /h. A fortiori, 
it will fail for example if W is a typical sample path of fractional Brownian motion with Hurst 
parameter H <\. 

In order to break through this barrier, the main idea is to exploit the a priori “guess” that 
solutions to (12.10b should “look like W at small scales”, i.e. in order to tty to define an integral 
f 0 Z dW for any two elements Z and W of some function space, we only consider those Z' s 
that could plausibly arise as Z = f{Y) for Y the solution to (12.10b . More precisely, one would 
naturally expect the solution Y to satisfy 

Y t = Y s + Y'W s , t + 0{\t - s\ 2a ) , ( 2 . 11 ) 


for some function Y', where we wrote W S) t as a shorthand for the increment iT) — W s . As a matter 
of fact, one would expect to have such an expansion with the specific choice Y’ = f(Y). As a 
consequence, one would also expect a similar bound to (12.1 lb for Z = f(Y) with Z’ = f'(Y). In 
other words, one would make the a priori guess that solutions to (12. 10 b satisfy |Y 1 + Y' X\\ 2 a < 
00, provided that we define || • \\ 2 a as in (12.7b with \X \ = a and T st as in (12.8b (with the caveat that 
in the multidimensional case one should now introduce symbols Xi for i £ { 1 ,... , m}, interpret 
(12.8b as a “vector” identity, take Y' matrix-valued, etc). 

It now remains to provide a coherent construction of the integral f Z dW for such functions 
(or rather pairs of functions ( Z, Z' )). The main idea is to then simply postulate the values of the 
integrals 


W 


S,t —• 


W S)T (g) dW r , 


■t 


( 2 . 12 ) 
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instead of trying to compute them from W (which is doomed to failure). For a two-parameter 
function W to be a “reasonable” candidate for the right hand side of (12.12 1) . one would like it to 
satisfy Chen’s relations 

W Sjt - W SjU - Wu,t = W s ,u ® W Ut t , (2.13) 

since these follow from the requirements that J* cdW r = cW s j for any c,s,t E R and that the 
sum of two integrals with the same integrand over adjacent intervals equals the integral over the 
union of these intervals. By simple scaling, it is also natural to impose the analytic bound 

|W s ,t| < |i-s| 2 “ . (2.14) 


One can then exploit this additional data to give a coherent definition of expressions of the type 
f Z dW, provided that the path W is “enhanced” with its iterated integrals W and Z is a “con¬ 
trolled path” of the type (12.1 II) . Indeed, it suffices to set 


Z„ dW, 


def .. 

= lim 

|f|-h) 


Y, (Z S W S , U + Z' S W s , u ), 
[s,u]eT 


where V denotes a partition of [0, t] (interpreted as a collection of closed intervals) and \V\ de¬ 
notes the length of the longest interval in V. It is a fact that the analytical bounds given above, 
toaether with (12.131). euarantee that this limit exists as soon as a>\ and that, if we set 

Y t = f Z s dW s l + Z t X, 

Jo 

then one has again 11 Yj | 2 a < oo, thus allowing to formulate (12.101) as a well-behaved fixed point 
problem. See for example tGub04ll or the lecture notes IIFH141 for a more detailed exposition. 


3 The basic theory of regularity structures 

Let us now set up in more detail a general framework in which one can define “Holder-type” 
spaces as above. Our first ingredient is a vector space T that contains the coefficients of our 
“Taylor-like” expansion at each point. In the previous example, this space was given by the linear 
span of 1 and X. In general, it is natural to postulate that T is an arbitrary graded vector space 
1 = ©aGA for some set A of possible “homogeneities”. For example, in the case of the usual 
Taylor expansion, it is natural to take for A the set of natural numbers and to have T contain the 
coefficients corresponding to the derivatives of order l. In the case of controlled rough paths 
however, we have already seen that it is natural to take A = {0,«}, to have again To contain the 
value of the function Y at any time s, and to have T a contain its “Gubinelli derivative” Y'. This 
reflects the fact that in that case the “monomial” t i-a (FI. s X)(f) = W s ,t only vanishes at order a 
near t = s, while the usual monomials t (t — sf vanish at integer order £. In general, we only 
assume that each T a is a real Banach space, although in many examples of interest these spaces 
will be finite-dimensional. 
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This however isn’t the full algebraic structure describing Taylor-like expansions. Indeed, we 
have already seen that a crucial characteristic of Taylor expansions is that an expansion around 
some point xq can be re-expanded around any other point x \, namely simply by writing 

(x — xo) m = Cr)( X i~ x °) k ' (x - Xl ) e ■ (3-1) 

k+£=m ' 

(In the case when x € R rf , k, i and m denote multi-indices and kl = k\l... A: r /!.) In general, 
we have seen in both of our examples that there are linear maps T st transforming the coefficients 
of an expansion around t into the coefficients of the same “polynomial”, but this time expanded 
around s. 

What is a natural abstraction of this fact? In view of the above examples, it is natural to impose 
that any such “reexpansion map” T s t has the property that if t € T a , then T st T—r £ ®/3<a T /3 
T <a . In other words, when reexpanding a homogeneous monomial around a different point, the 
leading order coefficient remains the same, but lower order monomials may appear. Furthermore, 
one should be able to compose reexpansions, since taking an expansion around t, reexpanding it 
around s and then reexpanding the result around a third point r should be the same as reexpanding 
the first expansion around r. In other words, it seems natural that one has the identity T rs r st = 
T rt , which is indeed the case for the examples we have seen so far. These considerations can 
be summarised in the following definition of an algebraic structure which we call a regularity 
structure: 

Definition 3.1 Let A C R be bounded from below and without accumulation point, and let T = 
©«gA Ta be a vector space graded by A such that each T a is a Banach space. Let furthermore 
G be a group of continuous operators on T such that, for every a £ A, every T € G, and every 
t € T a , one has Tr-r £ T <a . The triple 2? = ( A , T, G) is called a regularity structure with 
model space T and structure group G. 

Remark 3.2 In principle, the set A can be infinite. By analogy with the polynomials, it is then 
natural to consider T as the set of all formal series of the form XXgA r «’ w here only finitely 
many of the r Q ’s are non-zero, endowed with the topology of term-wise convergence. This also 
dovetails nicely with the particular form of elements in G. In practice however we will only ever 
work with finite subsets of A so that the precise topology on T does not matter. 

A regularity structure as given by Definition 13. ll is just a kind of algebraic “skeleton”: it does 
not require any underlying configuration space and it contains no information as to which actual 
functions / distributions its elements are supposed to describe. It only becomes useful for our 
putpose when endowed with a model, which is the analytical “flesh” associating to any r € T 
and xq £ R' ; , the actual “Taylor polynomial based at xq” represented by r. In order to link 
the algebraic description to the corresponding analytical objects, we want elements t £ T a to 
represent functions (or possibly distributions!) that “vanish at order a” around the given point xq. 

Since we would like to allow elements in T to represent distributions and not just functions, 
we cannot evaluate them at points and thus need a suitable notion of “vanishing at order a”. We 
achieve this by controlling the size of our distributions when tested against test functions that are 
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localised in a small region around the given point xq. Given a test function ip on R d , we write tp* 
as a shorthand for 

<Px(v) = A _< V(A _1 (y - x)) . 

Given r > 0, we also denote by B r the set of all smooth functions p-. R d -> R such that ||</?||c 7 ' < 
1, and that are furthermore supported in the unit ball around the origin. With this notation, and 
writing furthermore S'(R d ) for the space of distributions (not necessarily tempered) on R d , our 
definition of a model for a given regularity structure -7 is as follows. 

Definition 3.3 Given a regularity structure .7 and an integer d> 1, a model for .7 on R d consists 
of maps 


n: R d -A C(T, S'(R d )) T:R d x R d -> G 

x^Ii x ( x,y)^T xy 

such that r xy r yz = r xz and B :r T xy = \\y. Furthermore, given r > | inf A\, for any compact set 
ii c R and constant 7 > 0 , we assume that there exists a constant C such that the bounds 

l(n x r)(^)|<C , Al T l||r|| a , \\r xy T\\p<C\x-y\ a -P\\T\\ a , (3.2) 

hold uniformly over ip € B r , (x , y) € .ri, A € (0,1], r € T a with a < 7 , and (3 < a. We denote 
by .M the space of all models for a given regularity structure. 

Remark 3.4 Given t € T, we wrote ||r|| a for the norm of its component in T a . In other words, 
if r = ® a r a with r a € T a , then ||r|| Q = ||T a ||r Q - 

Remark 3.5 The identity II x T xy = II, ; reflects the fact that is the linear map that takes an 
expansion around y and turns it into an expansion around x. The first bound in (13.21) states what 
we mean precisely when we say that r e T a represents a term of order a. The second bound in 
(13.21) is very natural in view of both (12.51) and (12.81 ). It states that when expanding a monomial of 
order a around a new point at distance h from the old one, the coefficient appearing in front of 
lower-order monomials of order (3 is of order at most h a ~^. 

Remark 3.6 In many cases of interest, it is natural to scale the different directions of R cZ in a 
different way. This is the case for example when using the theory of regularity structures to build 
solution theories for parabolic stochastic PDEs, in which case the time direction “counts double”. 
To deal with such a situation, one can introduce a scaling s of R rf , which is just a collection of d 
mutually prime strictly positive integers and one defines ip* in such a way that the ith direction 
is scaled by A 0i . In this case, the Euclidean distance between two points should be replaced 
everywhere by the corresponding scaled distance Ms = Xa \xi\ 1 ^ Si . See also !Hail41 for more 
details. 

With these definitions at hand, it is then natural to define an equivalent in this context of the 
space of 7 -Holder continuous functions in the following way, which is the natural generalisation 
of© 
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Definition 3.7 Given a regularity structure 22 equipped with a model (II, T) over R, the space 
D 7 = F^i22, p) j s given by the set of functions /: —>• ® c T a such that, for every compact 

set it and every a < 7, the exists a constant C with 

II m - T xy f(y)\\ a < C\x - y y- a (3.3) 

uniformly over 

Remark 3.8 Note that, given 22 and a model (II, T) € .22., the corresponding space D 7 is a 
Frechet space. However, as we have already seen in Remark l2Tl above. this space does in general 
depend crucially on the choice of model. We thus have a “total space” .22 x T) 1 containing all 
triples of the form (n, F. F) with F € V 7 based on the model (n, F). This space is no longer 
a linear space, but it still comes with a natural topology: the distance between (n, T, /) and 
(FI, f, /) is given by the smallest constant g such that 

\\f(x) - f{x) - T xy f(y ) + T xy f(y)\\ a < g\x - t/| 7- “ , 

\iU x T-U x r)(^)\ < pA"||r|| , 

||r xy r - r xyT y < g\x - y \ a ~ 1 [|r||, 

uniformly for x. y in some compact set. 

At this point, we should pause and ask the following question: given / € V 7 for a given regu¬ 
larity structure and model, what is the actual function / distribution represented by /? Recall that 
we have seen previously in (12.41) that it should represent IZf given by (7 Zf)(x) = (II r /(.x))(.x). 
However, this definition now no longer makes sense since H x f(x) is a distribution in general and 
can therefore not be evaluated at x\ The most fundamental result in the theory of regularity struc¬ 
tures then states that, given / € 27 7 with 7 > 0 , there exists a unique distribution TZf on R ; such 
that, for every x € R d , Kf ‘ ‘looks like H x f(x) near x”. More precisely, one has 

Theorem 3.9 Let 22 be a regularity structure as above and let (n, T) € -/22 be a model for 22 on 
R d . Then, there exists a unique linear map 12 : 27 7 —¥ 6' , (R' / ) such that 

\{12f-U x fix))i^)\<X\ (3.4) 

uniformly over ip € B r and A as before, and locally uniformly in x. 

We do not provide a proof of this result in these notes, but a concise version can be found 
in HHail51 . It relies crucially on the fact that our assumptions guarantee that the distributions 
( x = n a ./(x) vary slowly with x: 

iCx - (y)(p x ) = n X (fix) - Y X yf{y))iip x x ) < Y, A Q Is - vV~ a < A 7 , (3.5) 

a<7 

for | y — z | < A. In particular, the value of Q v (p x ) varies by at most of the order A 7 as y varies 
over the support of ip x . In fact, one can show more generally that given a function x i-a ( x which 
is (locally) uniformly bounded in C a for some a < 0 and such that (13.51) holds for some 7 > 0, 
then there exists a unique distribution ( € C a such that (( — f r )(p x )\ < A 7 , locally uniformly in 
x, see IHail41 Prop. 3.25]. 
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Remark 3.10 The spaces V 7 are natural generalisations of Holder spaces. One may wonder 
whether there are also natural generalisations of other classical function spaces and whether The¬ 
orem [T9] still applies. In ||HL151 . the authors show that approximations to the multiplicative 
stochastic heat equation converge even when the initial condition is taken to be a Dirac mass. For 
this, it appears that weighted Holder-type spaces are not suitable spaces to work with. Instead, 
the authors work with a generalisation of a scale of inhomogeneous Besov spaces. It seems likely 
that most classical function spaces admit generalisations to the present context. The rule of thumb 
regarding the reconstruction theorem is that one typically has existence and uniqueness of the re¬ 
construction operator as soon as the “regularity index” of the corresponding classical function 
space is positive. 

Remark 3.11 Since the spaces X> 7 depend on the choice of model (n, T), the reconstruction 
operator R. itself also depends on that choice. Since the aim of the theory is to provide robust 
approximation procedures, we should really view TZ as a map from the total space .X/, v. D 7 into 
S\R d ). It turns out that even if viewed it in this way, TZ is still a continuous map, although it is no 
longer linear. The fact that this stronger continuity property also holds is crucial when showing 
that sequences of solutions to mollified equations all converge to the same limiting object. 

Remark 3.12 In the particular case where H x t happens to be a continuous function for every 
r € T (and every x € R (/ ), 7 Zf is also a continuous function and one has the identity 

(7 Zf){x) = (H X f(x))(x) . (3.6) 

This can easily be seen by noting that in this case, if p integrates to 1, then as A —» 0, the quantity 
(n X f(x))(cp x ) appearing in (13.41) converges to (U x f(x))(x). Since the right hand side converges 
to 0, this implies that ( JZf)(p x ) converges to (Il x f(x))(x), whence (13.61) follows. 

3.1 A classical result from harmonic analysis 

It is a classical result in harmonic analysis HBCDlll that the product extends naturally to C - a xC? 
into tS'(R d ) if and only if (3 > a. The reconstruction theorem yields a straightforward proof of 
the “if” part of this result: 

Theorem 3.13 There is a continuous bilinear map B : C S'( R d ) such that B(f, g ) = 

fgfor any two continuous functions f and g. 

Proof. Assume from now on that £ £ C~ a for some a > 0 and that / £ C A for some 8 > a. We 
then build a regularity structure ST in the following way. For the set A, we take A = N U (N — a) 
and for T, we set T = 1/0 W, where each one of the spaces V and W is a copy of the polynomial 
structure in d commuting variables described in Section [2] We also choose the structure group G 
as in the polynomial structure, acting simultaneously on each of the two instances. 

As before, we denote by X k the canonical basis vectors in V. We also use the suggestive 
notation “EX k ” for the corresponding basis vector in W, but we postulate that |EX fc | = a + \k\ 
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rather than |£A fc | = \k\ as would usually be the case. Given any distribution £ € C we then 
define a model (II\ T), where T is as in the canonical polynomial model, while IP acts as 

(n|x fc )(y) = (y - x) k , (n i~X k ){y) = (y - z) fc £(y), 

with the obvious abuse of notation in the second expression. It is then straightforward to verify 
that II y = H, o T xy and that the relevant analytical bounds are satisfied, so that this is indeed a 
model. 

Denote now by 'R0 the reconstruction map associated to the model (11^, T) and, for / € C ! , 
denote by F the element in 'Z> ' given by the local Taylor expansion of / of order 0 at each point. 
Note that even though the space T> 3 does in principle depend on the choice of model, in our 
situation F £ V 13 for any choice of £ since T xy is independent of £. It follows immediately from 
the definitions that the map x EF(x) belongs to so that, provided that f3 > a, one 

can apply the reconstruction operator to it. This suggests that the multiplication operator we are 
looking for can be defined as 

B(f,0 = Kt(EF) . 

By Theorem 13.91 this is a jointly continuous map from C 13 x C~ a into 6 V (R' / ), provided that 
d > a. If £ happens to be a smooth function, then it follows immediately from Remark T3. 1 2l that 
B(f, £) = /(x)£(x), so that B is indeed the requested continuous extension of the usual product. 

□ 


Remark 3.14 As a consequence of (13.41) . it follows that B: C a x -A C a . 

3.2 Products 

One of the main purposes of the theory presented here is to give a robust way to multiply distri¬ 
butions (or functions with distributions) that goes beyond the barrier illustrated by Theorem l3.13l 
Provided that our functions / distributions are represented as elements in V 7 for some model and 
regularity structure, we can multiply their “Taylor expansions” pointwise, provided that we give 
ourselves a table of multiplication on T. 

It is natural to consider products with the following properties. Here, given a regularity struc¬ 
ture, we say that a subspace V C T is a sector if it is invariant under the action of the structure 
group G and if it can furthermore be written as V’ = V a with V a C T a . 

Definition 3.15 Given a regularity structure (T, A, G) and two sectors V,V C T, a product on 
(V. V) is a bilinear map *: VxV —> T such that, for any r € V a and f € Vp, one has r*r £ T a+ p 
and such that, for any element T £ G, one has T(r x r) = Tr * Tt. 

Remark 3.16 The condition that homogeneities add up under multiplication is very natural bear¬ 
ing in mind the case of the polynomial regularity structure. The second condition is also very 
natural since it merely states that if one reexpands the product of two “polynomials” around a 
different point, one should obtain the same result as if one reexpands each factor first and then 
multiplies them together. 
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Given such a product, we can ask ourselves when the pointwise product of an element V 71 
with an element in D 72 again belongs to some D 7 . In order to answer this question, we introduce 
the notation Vf to denote those elements / G X ? 7 such that furthermore fix) G T> a for every x. 
With this notation at hand, it is not too difficult to verify that one has the following result: 

Theorem 3.17 Let f\ G Vff iV), f 2 G 'D/fiV), and let * be a product on (V, V). Then, the 
function f given by f(x ) = fi(x) * / 2 (x) belongs to Vf with 

a = ai + «2 , 7 = (71 + a 2 ) A (72 + a±) . (3.7) 

The proof of this result is straightforward and can be found in jHai!41 lHai!5ll . It is clear 
that the formula (13.71) for 7 is optimal in general as can be seen from the following two “reality 
checks”. First, consider the case of the polynomial model and take _/) G C 7 \ In this case, the 
truncated Taylor series F, for f; belong to Vff. It is clear that in this case, the product cannot be 
expected to have better regularity than 71 A 72 in general, which is indeed what (13.71) states. The 
second reality check comes from the example of Section I3T1 In this case, one has F G Vq, while 
the constant function igS belongs to V°f a so that, according to (13.71) . one expects their product 
to belong to D_~ a , which is indeed the case. 

Remark 3.18 In order to obtain robust approximation results, one would like to obtain bounds on 
the distance between /1 * / 2 and f \ * f 2 in cases where the f % belong to spaces P 7i based on some 
model (II, T) G -M, while the f, belong to spaces V li based on a different model (17, f) G .<#. 
This can also be obtained, but makes the proof slightly more lengthy. 

Remark 3.19 Even if both IZfi and lZf 2 happens to be continuous functions, this does not in 
general imply that U{fi * f 2 )(x) = ('/7,/'i )(x) (77./ 2 )(.i')! 

For example, fix k < 0 and consider the regularity structure given by A = {—2k, —k, 0), with 
each T a being a copy of R given by T_ nK = (S n ). We furthermore take for G the trivial group. 
This regularity structure comes with an obvious product by setting T rn * T n = 'E m+n provided 
that m + n < 2. Then a perfectly valid model is 

(IL^Xy) = 1 , (n^Xy) = 0 , (n x " 2 )(y) = c , (3.8) 

for which, setting F{x) = f{x)E° + (Xx)S 1 , one has (7 ZF){x) = f(x), but {1ZF 2 ){x) = f 2 (x) + 
cg 2 (x). This flexibility is crucial since the “naive” definition of the product is usually broken by 
renormalisation, as we will see in Section [43] below. 

3.3 Schauder estimates and admissible models 

One of the reasons why the theory of regularity structures is very successful at providing de¬ 
tailed descriptions of the small-scale features of solutions to semilinear (S)PDEs is that it comes 
with very sharp Schauder estimates. Recall that the classical Schauder estimates state that if 
K : R d -)■ R is a kernel that is smooth everywhere, except for a singularity at the origin that is 
(approximately) homogeneous of degree /3 — d for some (5 > 0, then the operator / 1 —>• K * / 
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maps C a into C a+/3 for every a € R, except for those values for which a + (3 € N. (See for 
example IISim97ll .) 

It turns out that similar Schauder estimates hold in the context of general regularity structures 
in the sense that it is in general possible to build an operator K : T)' ! -a V' r+l3 with the property 
that TZJCf = K * TZf. Of course, such a statement can only be true if our regularity structure 
contains not only the objects necessary to describe TZf up to order 7 , but also those required to 
describe I\ * 1Zf up to order 7 + (3. What are these objects? At this stage, it might be useful to 
reflect on the effect of the convolution of a singular function (or distribution) with K. 

Let us assume for a moment that / is also smooth everywhere, except at some point xq. It 
is then straightforward to convince ourselves that K * f is also smooth everywhere, except at 
x,(j. Indeed, for any 6 > 0, we can write K = K$ + /x|, where Kg is supported in a ball of 
radius 5 around 0 and Kg is a smooth function. Similarly, we can decompose / as / = f$ + /?, 
where fs is supported in a 5-ball around xq and /| is smooth. Since the convolution of a smooth 
function with an arbitrary distribution is smooth, it follows that the only non-smooth component 
of K * f is given by Kg * fs, which is supported in a ball of radius 25 around xq . Since 5 was 
arbitrary, the statement follows. By linearity, this strongly suggests that the local structure of the 
singularities of K * / can be described completely by only using knowledge on the local structure 
of the singularities of /. It also suggests that the “singular part” of the operator JC should be local, 
with the non-local parts of JC only contributing to the “regular part”. 

This discussion suggests that we certainly need the following ingredients to build an operator 
JC with the desired properties: 

• The canonical polynomial structure should be part of our regularity structure in order to be 
able to describe the “regular parts”. 

• We should be given an “abstract integration operator” X on T which describes how the 
“singular parts” of IZf transform under convolution by K. 

• We should restrict ourselves to models which are “compatible” with the action of X in 
the sense that the behaviour of II 7 Xt should relate in a suitable way to the behaviour of 
K * II x t near x. 

One way to implement these ingredients is to assume first that our model space T contains abstract 
polynomials in the following sense. 

Assumption 3.20 There exists a sector T C T isomorphic to the space of abstract polynomials 
in d commuting variables. In other words, T a 0 if and only if a € N, and one can find basis 
vectors X k ofT j/,| such that every element I G G acts on T by TX k = ( X — h) k for some h € R . 

Furthermore, we assume that there exists an abstract integration operator X with the following 
properties. 

Assumption 3.21 There exists a linear map X: T —> T such that IT n c T a+ a, such that IT = 
0, and such that, for every T G G and r € X, one has 


TXt - XTt € X . 


(3.9) 
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Finally, we want to consider models that are compatible with this structure for a given kernel 
K. For this, we first make precise what we mean exactly when we said that K is approximately 
homogeneous of degree (5 — d. 

Assumption 3.22 One can write K = ^ n>0 w ^ere each of the kernels I \ n : R (/ —» R is 
smooth and compactly supported in a ball of radius 2~ n around the origin. Furthermore, we 
assume that for every multiindex k, one has a constant C such that the bound 

sup \D k K n (x)\ < C2 n(d ~ f}+ |fc|) , (3.10) 

X 


holds uniformly in n. Finally, we assume that f K n (x)P(x) dx = 0 for every polynomial P of 
degree at most N, for some sufficiently large value of N. 

Remark 3.23 It turns out that in order to define the operator /C on V 1 , we will need K to annihi¬ 
late polynomials of degree N for some N > 7 + (3. 

Remark 3.24 The last assumption may appear to be extremely stringent at first sight. In practice, 
this turns out not to be a problem at all. Say for example that we want to define an operator that 
represents convolution with P, the fundamental solution to the heat equation. Then, P can be 
decomposed into a sum of terms satisfying the bound (13.101) with (3 = 2, but it does of course not 
annihilate generic polynomials and it is not supported in the ball of radius 1 . 

However, for any fixed value of N > 0, it is straightforward to decompose P as P = K + K, 
where the kernel K is compactly supported and satisfies all of the properties mentioned above, 
and the kernel K is smooth. Lifting the convolution with K to an operator from V 7 —>• X > 7+/3 
(actually to IT for any 7 > 0 ) is straightforward, so that we have reduced our problem to that of 
constructing an operator describing the convolution by K. 

Given such a kernel K, we can now make precise what we meant earlier when we said that 
the models under consideration should be compatible with the kernel K. 

Definition 3.25 Given a kernel K as in Assumption I3.22l and a regularity structure S satisfying 
Assumptions I3.20l and l3.2 1 1 we say that a model (n, F) is admissible if the identities 

(n x X k )(y) = (y - x) k , U x 1t = K *U X T-U x J(x)t , (3.11) 

hold for every r e T in the domain of X. Here, J(x): T -A T is the linear map given on 
homogeneous elements by 

J(x)T= ^2 ~JT [ D {k) K{x-y)(n x r)(dy) . 

\k\<\r\+p J 


(3.12) 
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Remark 3.26 While K*£ is well-defined for any distribution £, it is not so clear a priori whether 
the operator J(x) given in (13.121) is also well-defined. It turns out that the axioms of a model do 
ensure that this is the case. The correct way of interpreting (13.121 ) is by 

J(X)T= J2 ^(n x T )( D(k)K n( x - ■)) ■ 

|fc|<|r|+/3n>0 

The scaling properties of the K n ensure that 2^~^ n K n (x — •) is of the form cp\ with 
c p € B r (for arbitrary r), c ~ 1 and A ~ 2~ n . As a consequence of (13.21) . one then has 

\(U x r)(D^K n (x - -))| < 2^~^ n , 

so that this expression is indeed summable as long as \k\ < |r| + 3. 


Remark 3.27 The above definition of an admissible model dovetails very nicely with our axioms 
defining a general model. Indeed, starting from any regularity structure ST, any model (II, T) for 
■7, and a kernel K satisfying Assumption 13.221 it is usually possible to build a larger regularity 
structure Sf containing ,7 (in the “obvious” sense that T C T and the action of Cl on T is com¬ 
patible with that of G) and endowed with an abstract integration map X , as well as an admissible 
model (II, T) on -7 which reduces to (II, T) when restricted to T. See I Hai 14 1 for more details. 

The only exception to this rule arises when the original structure T contains some homoge¬ 
neous element r which does not represent a polynomial and which is such that |r| +/3 E N. Since 
the bounds appearing both in the definition of a model and in Assumption 13.221 are only upper 
bounds, it is in practice easy to exclude such a situation by slightly tweaking the definition of 
either the exponent /3 or of the original regularity structure ,7. 

With all of these definitions in place, we can finally build the operator /C: 77 7 —>• 73 7-7 ' 
announced at the beginning of this section. Recalling the definition of J from (13.121) . we set 


(/C/)(.T) = If{x) + J{x)f(x) + (A ff)(x) , (3.13) 

where the operator A f is given by 

(Aff)ix) = fr / D{k)R ( x - V) W - n X fix))idy) . (3.14) 

|fc|<7+/3 

Note first that thanks to the reconstruction theorem, it is possible to verify that the right hand side 
of (13.14b does indeed make sense for every / € D 7 in virtually the same way as in Remark 13.261 
One has: 


Theorem 3.28 Let K be a kernel satisfying Assumption 13.221 let L7 = (A, T, G) be a regularity 
structure satisfying Assumptions 13.201 and [071 and let (II, T) be an admissible model for XL. 
Then, for every f E D 7 with 7 € (0, A^ — j3) and 7 + /3 (f N, the function ICf defined in ( 13.731 ) 
belongs to 72 7+7 and satisfies 72/C/ = K * 72/. 


























20 


Application of the theory to the dynamical <b| model 


Proof. The complete proof of this result can be found in IlHai 14(1 and will not be given here. Let 
us simply show that one has indeed 7 ZK,f = K * IZf in the particular case when our model 
consists of continuous functions so that Remark [3712] applies. In this case, one has 

(KICf)(x) = (n X (lf(x) + J{x)f(x))){x) + (U x {Aff)(x))(x) . 

As a consequence of (13.111) . the first term appearing in the right hand side of this expression is 
given by 

(n x (Zf(x) + j(x)f(x)))(x) = (k * n X f(x))(x). 

On the other hand, the only term contributing to the second term is the one with k = 0 (which is 
always present since 7 > 0 by assumption) which then yields 

(n r (A ff)(x))(x) = J K(x - y) (Jlf - U x f(x))(dy) . 

Adding both of these terms, we see that the expression ( K * Il x f(x))(x) cancels, leaving us with 
the desired result. □ 


4 Application of the theory to the dynamical $3 model 


We now sketch how the theory of regularity structures can be used to obtain the kind of conver¬ 
gence result stated in Theorem ll.il We will only focus on the dynamical <I > 4 model and ignore the 
corresponding result for the KPZ equation. First of all, we note that while our solution ( I> will be 
a space-time distribution (or rather an element of XT' for some regularity structure with a model 
over R 4 ), the ‘ Time” direction has a different scaling behaviour from the three “space” directions. 
As a consequence, it turns out to be effective to slightly change our definition of “localised test 
functions” by setting 

¥>&,*)(*, V) = A ~ 5 p(\~ 2 (t - s), A -1 (y - x)) . 


Accordingly, the “effective dimension” of our space-time is actually 5, rather than 4. The theory 
presented above extends mutatis mutandis to this setting. (Note in particular that when consider¬ 
ing the degree of a regular monomial, powers of the time variable should now be counted double.) 
With this way of measuring regularity, space-time white noise belongs to C~ a for every a > |. 


4.1 Construction of the associated regularity structure 

Our first step is to build a regularity structure that is sufficiently large to allow to reformulate (IT 1 1 
as a fixed point in V 7 for some 7 > 0 . Denoting by P the heat kernel, we can write the solution 
to (ho with initial condition Tq as 


T = P*1 4> o(C-T 3 ) + PTo, (4.1) 

where * denotes space-time convolution and where we denote by P4>o the solution to the heat 
equation with initial condition <I>o. Here, l t> o denotes the indicator function of the set {(t. x) : 
t > 0}. In order to have a chance of fitting this into the framework described above, we first 
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decompose the heat kernel P as P = K + K, where I\ satisfies all of the assumptions of 
Section 13.31 with /3 = 2 and the remainder K is smooth, see Remark 13.241 For any regularity 
structure containing the usual Taylor polynomials and equipped with an admissible model, is 
straightforward to associate to K an operator K ,: 'D 7 —>• T> 7+ - via 

(£/)(*)= E fr (D {k) K*nf)(z), 

|fc|<7+2 

where z denotes a space-time point. Similarly, the harmonic extension of <I>o can be lifted to an 
element in V 7 (for any fixed 7 > 0) which we denote again by P4>o by considering its Taylor 
expansion around every space-time point. At this stage, we note that we actually cheated a little: 
while P<ho is smooth in {(£, x) : t > 0, x € T 3 } and vanishes when t < 0, it is of course singular 
on the time-0 hyperplane {(0, x) : x E T 3 }. Similarly, we have the problem that the function 
l t> o does not belong to any 'D 1 . Both of these problems can be cured at once by introducing 
weighted versions of the spaces V 7 allowing for singularities on a given hyperplane. A precise 
definition of these spaces and their behaviour under multiplication and the action of the integral 
operator TC can be found in I lHai 1411 . For the puipose of the informal discussion given here, we 
will simply ignore this problem. 

As in Section [Til we furthermore introduce a new symbol H which will be used to represent 
the noise £. This suggests that the formulation of ( l f I> 1 h in a suitable (weighted) space T) 7 should 
be of the form 

<S> = Vl t>0 (Z-<S> 3 ) + P<S> 0 , (4-2) 

where we set V = 1C + JC. In view of (13.131) . for t > 0 this equation is of the type 

4> = Z(H - <F 3 )+ (...) , (4.3) 

where the terms (...) consist of functions that take values in the subspace T C T spanned by 
the regular Taylor monomials X k . In order to build a regularity structure in which (14.31 ) can be 
formulated, it is natural to start with the structure given by abstract polynomials (again with the 
parabolic scaling which causes the abstract “time” variable to have homogeneity 2 rather than 1), 
and to add a symbol E to it which we postulate to have homogeneity — § , where we denote by 
a~ an exponent strictly smaller than, but arbitrarily close to, the value a. 

We then simply add to T all of the formal expressions that an application of the right hand side 
of (14.31) can generate for the description of 4>, 4' 2 , and 4> 3 . The homogeneity of a given expression 
is completely determined by the rules \1t\ = |r| + 2 and |rr| = |t| + |r|. More precisely, we 
consider a collection U of formal expressions which is the smallest collection containing X k and 
X(S), and such that 

ti, 72, r 3 € U =A X(tit 2 T3) € U , (4.4) 

where it is understood that Z(X k ) = 0 for every multiindex k. We then set 

W = {£} U {tit 2 t 3 : n € U] , (4.5) 

and we define our space T as the set of all linear combinations of elements in W. (Note that since 
1 € U, one does in particular have U C W.) Naturally, T a consists of those linear combinations 
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that only involve elements in W that are of homogeneity a. It is not too difficult to convince 
oneself that, for every a € R, W contains only finitely many elements of homogeneity less than 
a, so that each T a is finite-dimensional. 

In order to simplify expressions later, we will use the following shorthand graphical notation 
for elements of W. For H, we simply draw a dot. The integration map is then represented by a 
downfacing line and the multiplication of symbols is obtained by joining them at the root. For 
example, we have 

Z(~) = T, Z(S) 3 =V, 1(E)I(1(E) 3 ) = 'X' ■ 

Symbols containing factors of X have no particular graphical representation, so we will for ex¬ 
ample write XiZ(E) 2 = XX- With this notation, the space T is given by 

T = (S, % V, T , ' % , X$, 1,?, , 

where we ordered symbols in increasing order of homogeneity and used (•) to denote the linear 
span. Given any sufficiently regular function £ (say a continuous space-time function), there is 
then a canonical way of lifting £ to a model _£?(£) = (II, T) for T by setting 

(n x H )(y) = ay), (n X x k )( y ) = ( y - x) k , 

and then recursively by 

(U x Tf)(y) = (n x r)(y) • (II x f)(y) , dUb) 

as well as (13.111) . (Note that here we used x and y as notations for generic space-time points in 
order to keep notations compact.) 

4.2 Construction of the structure group 

So far, we have only described the vector space T arising in the construction of a regularity 
structure suitable for the analysis of the dynamical d> : ( model, but we have not yet described the 
corresponding structure group G. The reason why a non-trivial group G is needed is that in (13.111) . 
as soon as |Z(r)| > 0, II 2 Z(t) depends non-trivially on the base point z, so that G is needed in 
order to be able to enforce the algebraic relation II 2 = II-T^ of Definition 13.31 which should be 
interpreted as the action of “reexpanding” a “Taylor series” around a different point. 

In our case, in view of (13.111) . the coefficients of these reexpansions will naturally be some 
polynomials in x and in the expressions appearing in (13.121) . This suggests that we should define 
a space T + whose basis vectors consist of formal expressions of the type 

N 

X k n^(T t ), (4.7) 

i— 1 

where N is an arbitrary but finite number, the r, are basis elements of T different from the Taylor 
monomials X k , and the £ t are d-dimensional multiindices satisfying j(j;| < |r,| + 2. (The last 
bound is a reflection of the restriction of the summands in (13.121) with /3 = 2.) The space T + also 
admits a natural graded structure T + = 0 T+ by setting 

X fc | = \k\ , 


\Se(r)\ = |r| + 2 - \£\ , 
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and by postulating that the degree of a product is the sum of the degrees. Unlike in the case of T 
however, elements of T + all have strictly positive homogeneity, except for the empty product 1 
which we postulate to have degree 0. 

To any given admissible model (II, T), it is then natural to associate linear maps f x : T + —>• R 
by fx(X k ) = ( -x ) k , f x {acr) = f x (a)f x (a), and 

fMr) = ~ Y, [ D(i+k> K(x - y ) (n xT ){dy) . (4.8) 

\k+i\<\r\+2 ' J 

The minus signs are here purely by convention and serve to make some expressions simpler later 
onJl It then turns out that it is possible to build a linear map A: T -A T <g> T + such that if we 
define F x : T —> T by 

F x t = (id <g> /i)At , (4.9) 

where I denotes the identity operator on T, then these maps are invertible and, given an admissible 
model (II, T), 1 I F~ 1 is independent of x. If this is the case, then one can then recover the maps 

r X y by 

T X y = F~ 1 oF y . (4.10) 

The '‘correct” definition for the map A compatible with (13.111) and (14.81) is then given by 

A1 = 1 ® 1, AH = E ® 1 , AX = X®X , 
and then recursively by 

A(rr) = (Ar)(Af) , A X(f) = (1 <g> id)Ar + —— ® j4(t) . 

k 

Remark 4.1 This definition shows that the convention T{X k ) = 0 is compatible with the con¬ 
vention = 0. 

The identities (14.91 ) and (14.101) and the definition of f x suggest that one should take for G the 
set of all linear maps of the type 

Tjt = (id <g) /)Ar , 

where / is a multiplicative linear functional on T + . Before we show that G does indeed form a 
group, we argue that if Yl x is defined recursively as above and F x is given by (14.91) . then there 
exists a single linear map II: T —>• S' such that 

Il^r = IIFjT Vt S T . (4.11) 

We can simply exhibit II explicitly. Set 

(nH)(s) = ^(x ), (IIA fc )(x) = x k , (4.12) 


2 This definition differs from that in [ Hail41 by a simple change of basis in T + and, while the inclusion of the sum 
over k in ( 14.8b may not appear very natural here, it leads to more natural expressions for the maps A and A + below. 
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and then recursively 

rirf = IIt • Ilf , IlXr = K * IIr . (4.13) 

Note that this is very similar to the definition of Jzf(£), with the notable exception that (13.1 II) is 
replaced by the more “natural” identity IIXt = K * Ilr. It is then a simple exercise in binomial 
identities to show that indeed Tl x X k = HF x X k and that, assuming that (14.111) holds for some t 
and that f x is given by (14.81) . one also has I \,X(t) = UF x I(t). Finally, since all relevant objects 
are multiplicative, one can see that if (14.1 II) holds for symbols t and t , then it must also hold for 
their product rf. 

We now argue that G as defined above actually forms a group, so that in particular the maps 
F x are invertible. Indeed, if we define a map A + : T + —> T + ® T + very similarly to A by 

A+l = 1®1, A + X = X®X , 


and then recursively by 

A + (aa) = (A + cr)(A + a) , A + J^(t) = (J^ <g> id)Ar + ^ < 8 > J?e+ k(r) , (4.14) 

k 


then it can be verified that this map intertwines with A via the relations 

(A <g) I)A = (I <g> A + )A , (A + <g) /)A + = (I (g) A + )A + . (4.15) 


We then define a product o on the space of linear functionals /: T + — > R by 

(/ ° 9)(cr) = (/ ® £f)A + <7 . 


If we furthermore denote by Tf the operator T associated to any such linear functional as in (14.91 ). 
the first identitv of (14.151) vields the identity T/Fg — T/og- The first identity of (14.141 ) furthermore 
ensures that if / and g are both multiplicative in the sense that f(cra) = f{a)f{a), then / o r/ is 
again multiplicative. It also turns out that every multiplicative linear functional / admits a unique 
inverse / _1 = Af for some linear map A: T — » T such that / -1 o / = / o / _1 = e, where 
e: T + —> R maps every basis vector of the form (14.71 ) to zero, except for e(l) = 1. The element 
e is neutral in the sense that T e is the identity operator. 

It is now natural to define the structure group G associated to T as the set of all multiplicative 
linear functionals on T + , acting on T via ( 14.91 ). Furthermore, for any admissible model, one has 
the identity 


r — F~ 1 F — V 

1 xy — r x r y — L 'Txy 


7 xy — f x ° f'i 


Returning to the relation between IT, : and II, we showed actually more, namely that the 
knowledge of II and the knowledge of (II, T) are equivalent. Inedeed, one the one hand one has 
II = IT7^7 1 and the map F x can be recovered from 11, by (14.81) and (14.91) . On the other hand 
however, one also has of course 11, = IITjr 1 and, if we equip T with an adequate recursive struc¬ 
ture (denote by |r|j the number of times X appears in the symbol r), then it is possible to show 
that the determination of F ? 7 1 r (and therefore of I f,.r) only requires knowledge of f x (Xk(a)) 
(and therefore of n r <r) for symbols a with \a\j < \t\j — 1 . 
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Furthermore, the translation (II, T) AA II outlined above works for any admissible model and 
does not at all rely on the fact that it was built by lifting a continuous function. In particular, it 
does not rely on the fact that II X and II are multiplicative. In the general case, the first identity 
in (14.131) may then of course fail to be true, even if Ilr happens to be a continuous function for 
every r € T. The only reason why our definition of an admissible model does not simply consist 
of the single map II is that there seems to be no simple way of describing the topology given by 
Definition 13 .3 1 in terms of II. 

4.3 Renormalisation of the dynamical <I> 4 model 
Combining Theorem 13.171 and Theorem 13.281 we see that the map 

i-a V(E - <b 3 ), 

is a continuous map from V 7 into itself, provided that 7 > 1 + 2k. Actually, one obtains a 
regularity improvement in the sense that it maps D 7 into D 7 for some 7 > 7. In way reminiscent 
of usual parabolic PDE techniques, one can leverage this regularity improvement to build spaces 
D 7,,? (essentially weighted versions of the spaces D 7 which allow coefficients to become singular 
near times 0) such that the following holds. 

Theorem 4.2 For every a > — | there exist exponents 7 and // such that, for every <f>o € C a and 
every admissible model, ( 14.21 ) admits a unique local solution in Furthermore, this solution 
depends continuously on both <f>o and the underlying model. 


Remark 4.3 One hits some minor technical difficulties in the actual definition of local solutions 
to (14.21 ). but these are resolved in Mail411 . One can also show that solutions can be continued 
either forever or up to a time where the C a solution of the solution explodes. In this particular 
instance, one can actually show that solutions do not blow up, as shown in IIMW15 1. 

Given a model of the type «S?(0 constructed as in the previous subsection for a continuous 
space-time function 0 it then follows from ( 14.6bl) and the admissibility of 2z?(0 that the associated 
reconstruction operator satisfies the properties 

KlCf = K*Kf, n(fg) =Kf-Kg, 


as long of course as all the functions to which 72 is applied belong to V' ! for some 7 > 0 so that 
72/ is uniquely defined by (13.41) . As a consequence, applying the reconstruction operator 72 to 
both sides of (14.21 ). we see that if <h solves (14.21 ) then 72<I> solves the integral equation (14.11) . which 
thus yields the unique classical solution to d<f> 4 l) . Note however that this identification of solutions 
to (14.21) with solutions to (|E> relies crucially on the identity 7244> :! ) = (72 <b) 3 . This identity has 


no reason whatsoever to hold for a generic admissible model! In particular, while (14.21) makes 
sense for any admissible model, we do not know in general whether its solutions still solve a local 
PDE. 

At this stage, the situation is as follows. For any continuous realisation £ of the driving noise, 
we have factored the solution map (4>o, 0 -A 'F associated to d ( I> l into maps 


($o,0-> ($o,^(0) ^<&^72<b, 
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where the middle arrow corresponds to the solution to (14.21) built in Theorem 14.21 The advantage 
of such a factorisation is that the last two arrows yield continuous maps, even in topologies suffi¬ 
ciently weak to be able to describe driving noise having the lack of regularity of space-time white 
noise. The only arrow that isn’t continuous in such a weak topology is the first one. We also 
hope that the reader is convinced that a similar construction can be performed for a very large 
class of semilinear stochastic PDEs. In particular, the KPZ equation can also be analysed in this 
framework. 

Given this construction, one is lead naturally to the following question: given a sequence £ e 
of “natural” regularisations of space-time white noise, do the lifts j£?(£ e ) converge in probably in 
a suitable space of admissible models? Unfortunately, unlike in the case of the theory of rough 
paths where this is very often the case (but see IHai 1311 for an example where it fails there too), 
the answer to this question in the context of SPDEs is often an emphatic no. Indeed, if it were the 
case for the dynamical <I >3 model, then one could have chosen the constant C £ to be independent 
of e in (11.21) . which is certainly not the case. 

The way in which we are able to circumvent the fact that j£f(£ e ) does not converge to a limiting 
model as e —>• 0 is to consider instead a sequence of renormalised models. The main idea is to 
exploit the fact that our definition of a model does not impose the identity (I4.6bl) . even in situations 
where £ itself happens to be a continuous function. One question that then imposes itself is: what 
are the natural ways of “deforming” the usual product which still lead to an admissible model? 
It turns out that the regularity structure whose construction was sketched above comes equipped 
with a natural finite-dimensional group of continuous transformations 91 on its space of admissible 
models (henceforth called the “renormalisation group”), which essentially amounts to the space 
of all natural deformations of the product. It then turns out that even though «£?(£ e ) does not 
converge, it is possible to find a sequence M e of elements in 91 such that the sequence M e 2 §?(£ e ) 
converges to a limiting model (IT, T). Unfortunately, the elements M e no not preserve the image 
of S£ in the space of admissible models. As a consequence, when solving the fixed point map (14.21 ) 
with respect to the model M e j£?(£ e ) and inserting the solution into the reconstruction operator, it 
is not clear a priori that the resulting function (or distribution) can again be interpreted as the 
solution to some modified PDE. It turns out however that this is again the case and the modified 
equation is precisely given by (II.21) . where C £ is some linear combination of the two constants 
appearing in the description of M e . 

There are now three questions that remain to be answered: 

1. How does one construct the renormalisation group 91? 

2. How does one derive the new equation obtained when renormalising a model? 

3. What is the right choice of M e ensuring that the renormalised models converge? 

4.4 The renormalisation group 

In order to build the group 91, it turns out to be appropriate to describe its action first at the level 
of II rather than at the level of (n, T). At this stage we note that if £ happens to be a stationary 
stochastic process and II is built from £ by (14.121 ) and (14.131 ). then IIt is also a stationary stochas¬ 
tic process for every t € T. In order to define 91, it is natural to consider only transformations of 
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the space of admissible models that preserve this property. Since we are not in general allowed to 
multiply components of II, the only remaining operation is to form linear combinations between 
them. It is therefore natural to describe elements of 91 by linear maps M: T —> T and to postulate 
their action on admissible models by II i-a II M with 

n m t = n Mt . 


It is not clear a priori whether given such a map M and an admissible model (II, T) there is a co¬ 
herent way of building a new model , T M ) such that II A/ is the map associated to (II A/ , T M ) 
as above. It turns out that one has the following statement: 

Proposition 4.4 In the above context, for every linear map M: T -A T commuting with X and 
multiplication by X k , there exist unique linear maps : T —>• T <g) T + and A / : T + —>• 
T + <g) T + such that if we set 

n “ r = ( n -r ® fx )£ s M T , ' 7 xy ( a ) = (7 xy ® fx)(T , 

then II A/ satisfies again ( D. // D and the identity 11 A/ F A ^ = 11 A/ . 

At this stage it may look like any linear map M: T —>• T commuting with X and multiplica¬ 
tion by X k yields a transformation on the space of admissible models by Proposition 14.41 This 
however is not true since we have completely disregarded the analytical bounds that every model 
has to satisfy. It is clear from Definition 13 .3 1 that these are satisfied in general if and only if, for 
every symbol r, n^r is a linear combination of the I \ x t only involving symbols t with |r| > |r|. 
This suggests the following definition. 


Definition 4.5 The renormalisation group 91 consists of the set of linear maps M: T —» T com¬ 
muting with X and with multiplication by X k , such that for r G T (t and o € Tf, one has 

A M r-r®leT >a ®I + , A m ct - a ® 1 G T+ a ® T+ . (4.16) 

Its action on the space of admissible models is given by Proposition 14.41 


Remark 4.6 It turns out that the second condition of (14.161) is actually a consequence of the first 
one, see the appendix in | [HQ15fl . 


4.5 The renormalised equations 

In the case of the dynamical $3 model considered in this article, it turns out that in general 
(i.e. if we also want to cover the situation mentioned in the introduction where £ is approx¬ 
imated by a rescaled non-Gaussian stationary process) we need a five-parameter subgroup of 
94 to renormalise the equations. More precisely, we consider elements M € 94 of the form 
M = exp(— CiLf), where the generators L, are determined by the substitution rules 


L \: V 1—> 1 , L2 '■ V i-T 1 , T3: 1 ^ 1 , T4: 1—> 1 , T5 : 'f/ 1—> 1 . 
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This should be understood in the sense that if r is an arbitrary formal expression, then L\t is the 
sum of all formal expressions obtained from r by performing a substitution of the type V i-a 1, 
and similarly for the other L,. For example, one has 

LfV = 31 , = "l , L$fy = 31 , Lil = 0 . 

The convention Z{ 1) = 0 also suggests that one should set for example L- 2 <f = 0, since the 
symbol ■!/, which is what the substitution ?ol creates, would contain a factor of T(\). It is easy 
to see that all of the Li commute with each other so that, as an abstract group, the transformations 
we consider form a copy of R 5 with addition as its group operation. This is not a general fact 
however. One then has the following result: 

Proposition 4.7 The linear maps M of the type just described belong to 93. Furthermore, if 
(II, T) is an admissible model such that I l x r is a continuous function for every t € T, then one 
has the identity 

(nf r)(x) = (U x Mt)(x) . (4.17) 

Remark 4.8 Note that it it is the same value x that appears twice on each side of (14.171) . It is 
in fact not the case that one has H^V = U x Mt\ However, the identity (14.171 ) is all we need to 
derive the renormalised equations. 

It is now rather straightforward to show the following: 

Proposition 4.9 Let M be as above and let (II Y T M ) be the model obtained by acting with M 
on the canonical model (n,T) = for some smooth function £. Let furthermore <f> be the 

solution to ( 14.21 ) with respect to the model (I I u . T ;U ). Then, the function u(t, x) = (72 A/< I>)(f, x) 
solves the equation 

dtu = A u — u 3 + (3C*i — 9 C 3 — QCf)u — ( C 2 + 3 C 4 ) + £ . 

Proof By Theorem l3.171 it turns out that (14.21) can be solved locally in V 7 (or rather a weighted 
version of this space taking into account possible blowup near time 0 ) as soon as 7 is a little bit 
greater than 1. Therefore, we only need to keep track of its solution <I> up to terms of homogeneity 
1. By repeatedly applying the identity (14.31) . we see that the solution <b is necessarily of the form 

$ = T + ^1-T-3^Y+(V99,A), (4.18) 

for some real-valued function p and some Revalued function Vp. We emphasise that Vp is 
treated as an independent function here, we certainly do not suggest that the function p is differ¬ 
entiable! Our notation is only by analogy with the classical Taylor expansion. Similarly, the right 
hand side of the equation is given up to order 0 by 

= S-T'-3v?V + 3^-3vj 2 ! + 6^Y + 9^\^-3(Vc^,VA) -vj 3 1 . (4.19) 

Combining this with the definition of M, it is straightforward to see that, modulo terms of strictly 
positive homogeneity, one has 

M(E - 4> 3 ) = E - 


(M$) 3 + 3Ci T + C 2 1 + 3C!pl - 9C 3 T - 3C 4 1 
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— 6 C 5 T - 6 C 5 ^1 - 9C 3 ^1 

= S - (M$) 3 + (3Ci - 9C 3 - 6C 5 )M<t> + (C 2 - 3C 4 )1 . 

Combining this with (14.171) and applying as before the reconstruction operator 1Z M to both sides 
of 63, the claim now follows at once. □ 

4.6 Convergence of the renormalised models for Gaussian approximations 

We now argue that if = g £ * £ as in Theorem ll.il then one expects to be able to find constants 
C[ £) and C 3 (and set C <£> = 0 for i € { 2 ,4, 5}) such that the sequence of renormalised models 
M e 2 z?(£ e ) defined as in the previous subsection converges to a limiting model. Instead of con¬ 
sidering the actual sequence of models, we only consider the sequence of stationary processes 
II r := II £ M e r, where II £ is associated to (TF,r e ) = 2 zf(£ e ) as before. Since there are gen¬ 
eral arguments available to deal with all the expressions r of positive homogeneity, we restrict 
ourselves to those of negative homogeneity which, leaving out H which is easy to treat, are given 
by 

V, V, 1, XiV. 

Remark 4.10 Even if we can show that ITr converges weakly to some limit as e —>• 0, this 
does not necessarily imply that the corresponding sequence of models converges in their natural 
topology. We will not try to address convergence in the correct topology here, although this is a 
non-trivial problem. 

For this section, some elementary notions from the theory of Wiener chaos expansions are 
required, but we will tty to hide this as much as possible. Recall that II s T = K * = K e * £, 

where the kernel K e is given by K e = K * g e . This shows that, for e > 0, one has 

(ITV)O) = (K * £ £ ){zf = jJ K e {z - zi)K s (z - z 2 ) €(zi){(z 2 ) dz 1 dz 2 . 

Here, we make use of the fact that, for any two distributions ( and rj, the product £ • 77 is well- 

defined as a distribution on the product space. It is only when one tries to define a pointwise 
product yielding again a distribution on the same space that one runs into trouble. Similar but 
more complicated expressions can be found for any formal expression r. This naturally leads to 
the study of random variables of the type 

4 (Z) = /'"/ f( z i’--'-> z k)€( z i)-’-€( z k)dz 1 ---dzk ■ (4.20) 

Ideally, one would hope to have an Ito isometry of the type £/;.(/)4(. 7 ) = (/ sym , /y sym ), where 
(•, •) denotes the L 2 -scalar product and / sym denotes the synmietrisation of /. This would then 
allow to extend (14.201 ) from smooth test functions to all functions / € L 2 . It is unfortunately not 
the case that one has such an isometry. Instead, one should first replace the products in (14.201) by 
Wick products, which are obtained by considering all possible contractions of the type 


t(zi)£(zj) i-A :£(zi)£(zj): + 6(z t - Zj) . 
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For example, the distribution :£(zj) £(Zj) £(z k ): is defined by the identity 

£0i)£0j)£0fc) = ^(zi)^(zjX(z k ): + £( Zi)6(zj - z k ) + £(zj)5(z k - zf) + £(z k )8(zi - zf) . 

(See Section l5Tl below for a more general definition which covers Gaussian random variables as 
a special case.) If we then set 

4(/) = J - J f(z l, ■ • •, Zk) :£4i)''' £(zkY dz \ ■ ■ ■ dz k , 

which is well-defined for all smooth test functions /, one recovers indeed the Ito isometry 

E4(/)4(s0 = (f sym ,g sym ) • (4-21) 

We refer to IINua95l for a more thorough description of this construction, which also goes under 
the name of Wiener chaos, with random variables of the type I k (f) said to belong to the chaos of 
fcth order. While chaoses of different order arc not independent, they are orthogonal in the sense 

that E4(/)44) = 0 if k^t 

One very nice property is that one has equivalence of moments in the sense that, for every 
k > 0 and p > 0 there exists a constant C k .p such that 

E|4(/)r < C k jr m \\ P L2 < C k JfW P L2 , (4.22) 

where the second bound comes from the fact that symmetrisation is a contraction in L 2 . In other 
words, the pth moment of a random variable belonging to a Wiener chaos of fixed order can be 
bounded by the corresponding power of its second moment. The reason why such a bound is very 
useful is the following Kolmogorov-type result: 

Theorem 4.11 Let £ he a distribution-valued random variable such that there exists a < 0 such 
that, for every p > 0 , the bound 

E|£(v£)r < A°*\ (4.23) 

holds uniformly over ip € B r for some r > |o:|, all A € (0,1], and locally uniformly in x. Then, 
for every /3 < a, there exists a -valued version of f 

Remark 4.12 Here, we say that £ is a version of £ if, for every smooth test function ip, £(</?) = 
£(<£>) almost surely. 

Remark 4.13 There is an analogous result allowing to show the convergence of a sequence of 
models to a limiting model in the topology given by Definition 13.31 see I Hail4l Thm 10.7] for 
more details. 

Remark 4.14 As usual for Kolmogorov-type results, one can trade integrability for regularity in 
the sense that if (14.231) only holds for some fixed value p > 2, then we can only conclude that £ 

has a version in some (4 for (5 < a — -, where d denotes the scaling dimension of the underlying 

space. In our particular case one has d = 5 since we consider space-time distributions and “time 
counts double”. 
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Returning to our problem, we first argue that it should be possible to choose M in such a way 
that II V converges to a limit as £ -A 0. The above considerations suggest that one should rewrite 
n V as 

(n s V)(z) = (. K * ie){zf = J j K e (z - zi)K e (z - z 2 ) :£(Ai) €(z 2 ): dz x dz 2 + C £ , (4.24) 


where the constant C E is given by 


C e = J K^(zi) dz i 


K £ (z — z\) dz\ . 


(4.25) 


At this stage, it is convenient to introduce a graphical notation “a la Feynman” for these multi¬ 
ple integrals since expressions otherwise rapidly became unwieldy. Similarly to I Hai 14 HP 14 , 
IHS151 . we denote dummy integration variables by black dots, we write • for the distinguished 
variable z, O for a Wick factor of the type £(zi), -> for the kernel K s evaluated at the dif¬ 

ference between the variables representing its two endpoints, and we implicitly assume that all 
variables except for • are integrated out. With this notation, (14.241) and (14.251) can be expressed in 
a much friendlier and more compact way as 


nv 


o o 
\ / 

x ' 4- 

\ / i 




(4.26) 


This notation is also compatible with the graphical notation we are using for the various symbols: 
the graphical notation for II s r is essentially the same as that for t but, as a consequence of 
the definition of Wick products, one should sum over all possible graphs obtained by pairwise 
contractions of the noises O. 

Note now that K e is an e-approximation of the kernel K which has the same singular be¬ 
haviour as the heat kernel. In terms of the parabolic distance, the singularity of the heat ker¬ 
nel scales like K{z) ~ |-|~ 3 for z ^ 0. (Recall that we consider the parabolic distance 
\{t,x)\ = y/\t\ + | ,t|, so that this is consistent with the fact that the heat kernel is bounded by 
£- 3/2 ) -phis suggests that one has K*(z) ~ \z\~ 6 for \z\ 3> e. Since parabolic space-time has 
scaling dimension 5 (time counts double!), this is a non-integrable singularity. As a matter of fact, 
there is a whole power of 2 missing to make it borderline integrable, which correctly suggests that 
one has 



£ 


This already shows that one should not expect II e V to converge to a limit as e —>• 0. Indeed, as 
we will see presently, it turns out that the first term in (14.241) converges to a distribution-valued 
stationary space-time process, so that one would like to somehow get rid of this diverging constant 
C e . This is exactly where the renormalisation map M £ (in particular the factor exp(— C^L{)) 
enters into play. Following the above definitions, we see that one has 

(n £ v)(z) = (n e mv)(z) = (n £ v)(z) - c[ £) . 
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This suggests that if we make the choice C{ £) = C £ , then TTV does indeed converge to a non¬ 
trivial limit as £ A 0. Writing -* for the kernel K, this limit is a distribution which can 

formally be described as 

o o 

n ' v = \/ • 

with the implicit understanding that (n £ V)(A) does not make sense as a random variable, but must 
first be integrated against a smooth test function in order to produce a random variable belonging 
to the second Wiener chaos. Using the scaling properties of the kernel K, it is not too difficult to 
show that this procedure works, and that the resulting random variables satisfy a bound of the type 
considered in Theorem 14. 11 1 with a = — 1. Once we know that IT V converges, it is immediate 
that II XV converges as well, since this amounts to just multiplying a distribution by a smooth 
function. 

A similar argument to what we did for V allows to take care of r = T since one then has 

o ? o /' N q 

(n £ v) 0 ) =''!/' + 3 [ \ : . 

v ’ w V *• / 


Noting that the second term in this expression is nothing but 3C £ (II e l)(z), we see that in this 
case, provided again that C'\ £> = CL, TUT is given by only the first term in the expression above, 
which turns out to converge to a non-degenerate limiting random distribution in a similar way to 
what happened for V, but this time the Kolmogorov-type bound only holds for a = — |. 

Going down our list of terms of negative homogeneity, we see that it remains to consider 'X, 
\9, and <|a It turns out that the last one is the most difficult, so we only discuss that one. The 
explicit expression for IT^ is given in our graphical notation by 


IT ^ 


Q 9 P 


9 p 


q 


p 


O / I O O I o 
/ +6- / +3 \ / + 

\ W w 




p 


/ \ 
I I 

XL 


P 


+ 6 .; / + 6 .; 


'>• 


->•<- 


It turns out that all of these terms, except for the first two, are divergent as e — » 0, so we have 
to hope that our definition of II creates sufficiently many cancellations to take care of them! A 
simple calculation using our definition of M e shows that one has the identity 

n £ ^ = n £ ^ - 3C{ £) n £ ^ - c{ £) n £ T + 3(<p[ £) ) 2 rn - 3d; £) n £ T. 


(Recall that we have set C- = 0 for i p {1,3}.) Inserting the definition of II £ into this expres¬ 
sion and making use of the identity C{ £ = CL with C £ given by (14.261) . we see that many terms 
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do indeed cancel out, finally yielding 


Q ° P 


O 


O 


o 


n'^ = °\j/° + 6 


/° +6 


'>• 


\ ▼ / 


3 Cf } 


O 


(4.27) 


This is still problematic: the penultimate term in this expression contains the kernel 


Qe = •-- >• • (4.28) 

s ✓ 

(This is a slight abuse of our notation: the two variables • are fixed and Q e is evaluated at their 
difference.) As e —>• 0, Q e converges to a kernel Q, which has a non-integrable singularity at the 
origin, thus preventing the corresponding term to converge to a limit. 

This is akin to the problem of making sense of integration against a one-dimensional kernel 
with a singularity of type l/\x\ at the origin. For the sake of the argument, let us consider a 
function W : R —>• R which is compactly supported and smooth everywhere except at the origin, 
where it diverges like W(x) ~ 1 /\x\. It is then natural to associate to W a “renormalised” 
distribution MW given by 


(MW)(<p) = J W(x)(tp(x) — ip( 0 )) dx . 

Note that MW has the property that if tp(0) = 0, then it simply corresponds to integration against 
W, which is the standard way of associating a distribution to a function. In a way, the extra term 
can be interpreted as subtracting a Dirac distribution with an “infinite mass” located at the origin, 
thus cancelling out the divergence of the non-integrable singularity. It is also straightforward to 
verify that if W £ is a sequence of smooth approximations to W (say one has W e (x) = W(x) for 
\x\ > e and W £ ~ 1/e otherwise), then £%W £ —>• MW in a distributional sense, and (using the 
usual correspondence between functions and distributions) one has 

MW e = W £ - C £ S 0 , C £ = [ W £ (x) dx . 


The cure to the problem we are facing for showing the convergence of II £ \^ is virtually identical. 
Indeed, by choosing 


/'AA_ 

°3 — 



(which is a constant as it does not depend on • by translation invariance) the term in the first 
homogeneous Wiener chaos for II ^ is precisely given by 


6 


{MQ e * K £ )(z - Z 2 )£(Z2)dZ2 ■ 
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It turns out that the convergence of 2%Q e to a limiting distribution MQ takes place in a sufficiently 
strong topology to allow to conclude that II ^ does indeed converge to a non-trivial limiting 
random distribution. 

It should be clear from this whole discussion that while the precise values of the constants 
C\ £> and C *3 depend on the shape of the modifier g, the limiting (random) model (fl, f) obtained 
in this way is independent of it. Combining this with the continuity of the solution to the fixed 
point map (14.21) and of the reconstruction operator 71 with respect to the underlying model, the 
conclusion of Theorem ll.ll follows. 

5 Convergence of other smooth models to $3 

In this final section, we give a short overview of the ideas involved in showing that different kinds 
of smooth models (and not just the simplest one obtained by hitting the noise with a smooth 
modifier) also converge to the same process. 

5.1 Non-Gaussian approximations 

We first discuss how the argument sketched in the previous section can be modified to deal with 
the case when £ e does no longer denote some simple modification of £, but instead denotes a 
suitable rescaling of an arbitrary stationary space-time stochastic process, i.e. one has 

£e(t,x) = £~^r](t/e 2 ,x/£) , 

for a smooth stationary process ijonRxRl As already mentioned in the introduction, it is then 
possible to find choices of constants C® such that solutions to 

d t $> £ = a$ £ + ci V) + cf $ £ - + &, 

converge as £ —>• 0 to the process <t> built in Theorem 11.11 In view of Proposition 14.91 this is 
again a consequence of the fact that the renormalised models IT built from as in Section 14.61 
converge to the same limit as before, provided that the renormalisation constants C ( f^ are suitably 
chosen. The purpose of this section is to show why ad of the constants Cf'* with i e {1,... , 5} 
should in general be set to non-zero values. 

We essentially follow the arguments of IIHS151 and we will completely ignore issues related 
to the fact that one wishes to consider the limiting equation on a torus rather than the whole space. 
We assume that the rx-fields generated by point evaluations of rj in any two regions separated by a 
distance of at least 1 are independent. The kind of example for rj one should have in mind is 

Viz ) = J ip{z - z) P fiidz, dip) , 

where P /( denotes a compensated Poisson point process on R 1 x C q° with intensity measure 
p(dz, dip) = dz v{dip), where v is a finite measure charging only functions with support contained 
in the unit bad and satisfying suitable moment conditions to guarantee that 77 admits moments of 
ad orders. 
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Since is no longer Gaussian, we no longer have Wiener chaos decomposition at our dis¬ 
posal. However, there exists an analogue to the Wick product for arbitrary collections of random 
variables, and this will be sufficient for our purpose. In order to introduce the Wick product, we 
first need to introduce joint cumulants. Given a collection of random variables X = {X a } aG A 
for some index set A, and a subset B C A, we write Xb C X and X B as shorthands for 

X B = {X a : aeB}, X B = \\ X a . 

aeB 

Given a finite set B, we furthermore write V( B) for the collection of all partitions of B, i.e. if 
B is non-empty, then V(B) consists of all sets 7r C &(B) (the set of subsets of B) such that 
(J 7 T = B, 0 € it, and such that any two distinct elements of 7r are disjoint. With this definition, 
we in particular have V(0) = {{0}}- The joint cumulant E c (Xg) for a collection X B of random 
variables is then given by the following defining property. 

Definition 5.1 Given a collection X of random variables as above and any non-empty finite set 
B C A, one has 

E(X b )= £ l\E c (X B ). (5.1) 

7T GV(B) BGn 

Note that this definition naturally enforces the convention that E C (W/,) = 1. We can now 
define the Wick product of an arbitrary finite collection of random variables: 

Definition 5.2 The Wick product :X B : for B C A is defined recursively by 

= Y, ■■ X B■ E n E ^ X b) • (5-2) 

BcA ir£V(A\B) BGtt 

Similarly to above, this enforces the natural convention that E:X= 1. In other words, to 
turn a product of random variables into a sum of Wick products, one now not only considers all 
possible ways of replacing pairs of random variables by their covariance, but all possible ways of 
replacing subsets of random variables by their joint cumulants. 

Denote now by 4 £) the pth joint cumulant of £ e , namely 

Kp\zi, ...,Zp) = E c ({£ e (2i), . . . ,£e(Zp)}) ■ 

By translation invariance, this function depends only on the differences between its p arguments. 
Furthermore, it satisfies the scaling reation 

Kp\zi, ■■■,Zp) = £~^ KpiSeZ!, S £ Z p ) , 

where k p denotes the pth joint cumulant of r / and the scaling operator S £ acts on R 4 as S £ (t, x) = 
(f/e 2 ,x / e). We then use the same graphical notation as in IIHS151 for these cumulants: a red 
p-gon for instance |! ”, for p = 4, represents the cumulant function K ( p } (z\ ,..., z p ), with z t given 
by the p integration variables represented as before by the p black dots. This time furthermore, O 
denotes a Wick factor of £ e instead of £. 
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As a consequence of Definitions 15. II and 15.21 we then have for example the identity 


O 


o 


(n e v)(U = 


o 


+ 3 



A powercounting argument shows that this additional term also diverges as e —>• 0, which suggests 
that in this case we should no longer set = 0 as in the Gaussian case but instead we should 
choose the relevant renormalisation constants as follows: 


C[ £) = 




(5.3) 


This choice of renormalisation constant then yields just as in the Gaussian case 

o 0 o 

(n e ^)(z)= \|/ , 

which can then be shown to converge to a finite limiting distribution as e —>• 0. 

Remark 5.3 The fact that fl'7 is tight in some space of distributions as e —>• 0 can be shown 
quite easily by first testing against a test function and then bounding its second moment. Tightness 
in the correct topology and identification of the correct limit is more delicate and requires control 
on moments of all orders. Unfortunately, these can no longer be obtained as a corollary of a 
second moment bound since the analogue to (14.221) does not hold anymore in general. 

We will henceforth ignore these difficulties and admit without proof the fact that if / is an L 2 
kernel of k variables and f e is a sequence of “sufficiently nice” kernels converging to /, then the 
sequence of random variables 

J fe (Zl, ...,z k ) :£e(Zl) • • • ie(ZkY.dz\ ■ ■ ■ dz k , 

converges weakly to I k (f) as e -) 0. This convergence furthermore holds jointly for any finite 
collection of such random variables. See IIHS 151 for more details. 


The necessity of the remaining renormalisation constants can again be traced back to the 
analysis of II \Ja This time, one has the much lengthier identity 













37 


Convergence of other smooth models to <J>| 



At this stage, it starts to become clear that on should in principle start to use a more systematic 
approach, but for the sake of this article, and in order to give a feeling for the kind of bounds 
involved in the proofs, our more “brutal” approach is sufficient. 

In order to bound II £ \£ we first note that as a consequence of the fact that I\ integrates to 
0 , both the last term on the first line and the first term on the third line of the above expression 
vanish. This time, taking into account the fact that all of the C { f ] can be chosen to be non-zero, 
we obtain the identity 

= ir^ - 3Cf : 'n £ ^ - c[ £) ir? + 3(cf ) ) 2 n e, i - 3^ £, n s i - c[ £) n e i - 2cf ) n e T. 


We claim that besides (I5.3I ). the correct choice of renormalisation constants in order to obtain the 
same limiting model as in Section l4~6l is given by 



The four terms appearing on the first line are direct analogues of the terms appearing in the 
Gaussian case (14.27b . In view of Remark 15.31 it is therefore at least very plausible that they 
converge to the same limit. It remains to argue that the terms appearing on the last line vanish in 
the limit e —>• 0. For this, the following simple lemma (see I HS 15„ Lem. 4.7] for a slightly more 
general version) is useful: 
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Lemma 5.4 One has the bound 


][[ | K(zi - zdW^Xzx, 


2—1 


r, Tl 

■ , z n ) I dzi--- dz n < e 5 ( n /2-i) / ^ (\ z . _ ~| 4- e ) -3 , 

•^ r4 i=i 


uniformly over z\,..., z n € R 4 and e > 0 . 

Recall that since we are in space dimension 3, both the heat kernel and its truncation K 
satisfy the bounds |/\(z)| < |-| -3 , so that each factor appearing on the right hand side of this 
bound behaves essentially like a factor K e . In particular, writing 



with the same conventions as in (14.281 ). it follows that the kernels Q® satisfy the bounds 




< 


e K \z 


_i_ K 
2 K 


\Q?\z)\ 


< 


£ K \Z 


_2_ k 
2 K 


\Q ( e\z)\ < e K (\z\+£) 


—5— K, 


provided that one chooses k > 0 sufficiently small. In particular, both Q { } ] and (f F l] converge to 0 
in L p for p sufficiently small (but still greater than I), which allows to show that the first two terms 
on the second line of (15.41) vanish in the limit. To show that the last two terms also vanish in the 
limit, one needs to exploit cancellations between them: taken separately they converge to finite 
non-vanishing limits. However, since Q ® is integrable with its integral remaining uniformly 
bounded as e —>• 0, it is easy to show that CffH-) — converges weakly to 0, where C ( f. 2 

denotes the second term appearing in the above definition of C 2 } . While this in itself is not 
sufficient to guarantee that the sum of the last two terms in (15.41) indeed vanishes in the limit, it is 
a strong indication that it does, and is in fact not very difficult to show. 

In a similar way, it is possible to show that II r converges to the same limit as in Section l4~6l 
for every symbol r G W. With some additional effort, one can then show that this convergence 
actually takes place in the topology of the space of models for (T, Q) which, when combined 
with the continuity statement of Theorem 14.21 the identification of the renormalised solutions 
of Proposition 14.91 and the explicit expressions of the renormalisation constants, immediately 
implies that the solutions to (11.61 ) with constants as in (11.71 ) do indeed converge to the same 
limiting object as constructed in Theorem ll.il 


5.2 Non-cubic approximations 

In this last section, we give a very short sketch of the main ideas appearing in the proof of The¬ 
orem [T3] Recall that we are interested in the study of (11.31) as the effective potential (V$) under¬ 
goes a pitchfork bifurcation. In order to concentrate on the main ideas, we restrict ourselves to 
the case where V is of degree 6 . Then, normalising the equation such that dgd^,(Vg)(0) = —\ 
and (Vo) ( 4 ) (0) = 6 , the derivative of the effective potential (Vg) is given to lowest order in 6 by 

= 8 $- $ 3 - a $ 5 , 


(5.5) 
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for some constant a. From now on we neglect terms of higher order in 9 and we consider the case 
of a potential with derivative equal to (15.51 ). 

Note that (15.51) is the derivative of the effective potential. After rescaling, the model (11.31) we 
consider is therefore given by 

dt$ e ,o = - H 3 ($, C/e) - aeHff®, C/e) + (5.6) 

= A$ £)(9 + e~\9 + 3 C- 15aC 2 )<F - (1 + 10aC)<f> 3 - aeT> 5 + , 

where C is the constant appearing in (11.41) . In this equation, H n {x, c) denotes the nth Hermite 
polynomial with variance c, i.e. Hffx, c) = x 3 — 3ex, etc. A special case of Theorem 1 1.5l is then 
given by, the following 

Theorem 5.5 There exist values a > 0 and b £ R, such that if one sets 9 = ae log e + be, then 
the solution to (15.61) converges as e —>• 0 to the process <f> built in Theorem 17.71 

The remainder of this section is devoted to a sketch of the proof of Theorem l5.51 It is not clear 
a priori how to tit (15.61 ) into the framework we developed in this article. One might think that one 
could try to change (14.21 ) into something like 

$ = VI f>0 (H + ci$ - c 2 <f > 3 - ae<P 5 ) + PT * 0 , (5.7) 

for suitable choices of constants Cj, but one then immediately runs into two (related) problems: 

• In order to interpret (15.71 ) as an equation in V 7 for some regularity structure, we would 
naturally have to replace the recursive step (14.41) by something like 

n G U = 4 > 24T1T2T3T4T5) € U . 

The problem then is that the resulting collection of symbols no longer has the property 
that, for every 7 € R, there are only finitely many symbols of homogeneity at most 7. 
This pretty much destroys the rest of the argument. 

• The whole point of our construction was to build a solution map that no longer depends on 
e and to hide all the singular e-dependency of the solution in the convergence of a suitable 
sequence of “models” to a limit. In the case of (15.71) . we still have an e-dependent fixed 
point problem. This is of course not a problem if its solution depends continuously on e 
as e —>• 0, but this is precisely not expected here: Theorem 1 1.5 1 states that the effect of the 
quintic term is still felt in the limit since the constant appearing in front of the cubic term 
in the limit is given by the cubic term of the effective potential (Vo) and not by that of Vo- 

In order to circumvent these problems, the idea introduced in | |HQ15| is to build a regularity 
structure which encodes the operation “multiplication by e” as a non-trivial operation in V 7 , thus 
allowing us again to set up an e-independent fixed point problem and to shift all of the difficulties 
to that of proving that a suitable model converges to a limit. 

For this, besides the symbols X, H and the operator X used before, we introduce a new opera¬ 
tor £ and we consider instead of (15.71) a fixed point problem of the type 


- $ 3 - a£4> 5 ) + 0 , 


(5.8) 
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where £ will be an operator acting on V 7 built from £ (in a way reminiscent of how V and JC 
were built from X). The recursive step (14.41 ) in the construction of the regularity structure is then 
replaced by 

n € U => X(TiT2T 3 ) e U h X(£(TiT2 T3T4T 5 )) G U . 
and the definition (14.51 ) of the collection of symbols W is replaced by 

W = {£} U {1TT2T3 : Ti € 11 } U {£{tit 2 t 3 t 4 t 5 ) : t* € U) . 

We also define a slightly larger collection of symbols 

Wex = W U {tiT 2 T 3 T 4 T 5 : Tj € ZY} , 

and we call sini dai ly to before T and T ex the linear spans of W and £ ex respectively. In this 
construction, we consider the symbol £ as an “integration operator of order 1” in the sense that 
we set |£(t)| = |r| + 1. This might seem strange at first sight: after all £ is supposed to represent 
multiplication by the number e, while the homogeneity is supposed to be associated to some kind 
of regularity, but we will see that this is actually quite natural. 

The structure group Q associated to the two graded spaces T and T ex is then built in an anal¬ 
ogous way to the construction of Section 14.21 The difference is that elements of T + are now not 
just polynomials in X and J^(t) as in (14.71) . but are also allowed to contain factors X)(t), with 
the maps A and A + defined as above, but with the additional rule 

_ 

A £{t) = {£ <g> id)AT + — <g) <4(t) , 

k\ 

k 

and si mil aiiy for A 1 . Given a smooth space-time function £, we now have & family of canonical 
lifts 22? £ (Q = (II f£) ,r (£ >) of £ to models on .% x given as before by (14.61 ) and (13.111) . together with 

(Jlf£{r)){z) = e{HfT)(z)-e £ (5.9) 

|fc|<M+i 

The operators arc also built in exactly the same way as in Section 14.21 with the additional 
definition 

/i e V<r) = -£ ( -^(D k+e U^T)(z) , (5.10) 

|fc+r|<|r|+l 

which in particular allows us to rewrite (15.91) as 

Iif£{r) = ertfV + Y, U( * ) (X fzK&T) • (5.11) 

|fc|<|r|+t 

These definitions guarantee that if we define IlW as in Section l4~2l one has 

n (£) X(r) = £ n (E) T , (5.12) 


and it is in this sense that £ represents multiplication by e. 
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Remark 5.6 These definitions deviate slightly from those given in |HQ 15| . They are however 
equivalent and amount to a simple change of basis in T + . The reason for our choice here is that 
it leads as in (14.141 ) to the nicer expression 


X k 


A + St{r) = (Si ® id)Ar + ^ <g> Si +k (r) , 


for A + , where we used similarly to before the convention that Stir) = 0 unless |r| + 1 > \t\. 


Remark 5.7 At this stage, we would like to point out one fundamental difference between £ and 
X. In the case of X, the fact that it represents K in the sense of (13.1 II) was made part of the 
definition of our notion of admissible model, and we only ever considered renormalisation proce¬ 
dures that preserve the set of admissible models. In the case of £ however, we only impose (15.101) 
and (15.121) for the canonical lifts 2 z? e (£). In general, we do not impose that these identities are 
satisfied for an arbitrary admissible model. In particular', it is crucial to allow for renormalisation 
procedures that break them. This is reminiscent of the status of the product which is typically not 
preserved by renormalisation. 

This construction also suggests a definition for the operator £\ given an admissible model 
(II, /), we set 

(£U)(z) = £U(z) - ]T (X + Z) f z (Si(U(z))) . 

i 

Combining this with (15.1 II ). it follows immediately that for a canonical model of the type 2 z? e (£), 
one has the identity 

TZ£U = eRU . 


This however is a particular property of such canonical models. In general, there is no reason for 
1Z£U to be proportional to 1ZU\ What is more, the operator £ has a regularising property in the 
X 7 -scale of spaces. More precisely, we have the following result, the proof of which is given in 
[HQ15 Prop. 3.15], 


Proposition 5.8 For the regularity structure (T cx . G) just constructed, there exists 5 > 0 such 
that the operator £ is bounded from X 7 to X (7+1 ) A<5 . 

In particular, combining this with Theorem 13.171 it turns out that if we set |H| = — | — k for 
k > 0 small enough, then the map 4? V£ ( l > ~ > is bounded from X 7 into X 7 as soon as 7 > 1 +4k. 

Similarly to before one can show that, for any admissible model and for sufficiently regular initial 
conditions, the fixed point problem (15.81 ) admits short-time solutions, with the existence time of 
these solutions uniformly bounded over bounded balls in the space of admissible models, and 
solutions being continuous not just as a function of their initial condition but also as a function 
of the model. Furthermore, for models of the type .%U£) (with £ continuous), these solutions are 
such that ip = solves 


dtp = Ap - p 3 - asp 5 + £ , p( 0 , •) = $0 • 
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Remark 5.9 One crucial remark at this stage is that solutions to (15.81 ) do not depend on the 
full model on (T ex , Q), but only on its restriction to the smaller regularity structure (T. Q). In 
particular, these solutions make sense for any admissible model on (T, Q), even if such a model 
does not come from a model on (T ex , G)l 

As before, one cannot expect that ,iG(£ s ) converges to a limit if £ e is a mollification of white 
noise. However, it is possible to perform a renormalisation such that the models M e Jz? e (£ e ) con¬ 
verge to a limit on the smaller structure (T, Q). Even after renormalisation, these models do not 
converge on (T ex , Q), but thanks to Remark 15^91 this does not matter for us. Let us now describe 
the renormalisation maps M suitable in this setting. We keep the same graphical notation for 
basis vectors of T /T ex as previously, with £ depicted by a circle. With this notation, one has for 
example 

£CT(H) 5 ) = , 1{E) 2 1(£(1(E) 5 )) = 455. ■ 

Since we only work with Gaussian approximations, we only keep L\ and L 3 from Section 1431 
which we rename L\ and L 2 to avoid confusion. The operator L\ is extended to all of T ex in 
the same way as before by furthermore postulating that it commutes with £, so that one has for 
example 

Li<&= Qy, L^= + 

In the case of L 2 , we extend it to T ex by simply postulating that it vanishes on all symbols except 
for and \£, so we do for example set = 0 rather than 1 * 2 $ = £(1). Finally, we add two 
more operators L 3 and L 4 given by 

L$$ = l, L 3 ^ = 4T, L 4 #* = l, L 4 -# = 51, (5.13) 

as well as L,t = 0 for all combinations of i £ {3,4} and r E W not appearing in (15.131) . 
The rationale here is that L 3 “contracts” all instances of '0 , which appears 4 times in -'0 and 
nowhere else, and similarly for Z 4 . Similarly to before, we then consider renormalisation maps 
of the form M = exp(— JT Gjl/j). There is an analogue in this context to Proposition 14.71 which 
shows that these renormalisation maps arc “legal” in the sense that they induce an action on 
admissible models with the property that a model represented by II is mapped to one represented 
by n M. 

By a calculation analogous to that of Proposition 14. 91 one can then show the following. 

Proposition 5.10 Let M be as above and let (II M ,r M ) be the model obtained by acting with 
M on (n, T) = J 2 ? e (Q for some smooth function £ and some £ > 0. Let furthermore 'I* be the 
solution to ( 15. SI) with respect to the model (n A/ , T AI ). Then, the function u(t , x) = x ) 

solves the equation 

dtu = A u — H^lu, Ci) — aeH^lu, C\) — ( 9(72 + 20a(7 3 + 2ba?Cf)u + £ , 
where H n denotes the nth Hermite polynomial as before. 
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Given c € R and e > 0, we now define M C)£ = exp(— JA C { f'" 1 L ,) by setting 



with the same graphical notations as in the previous section. Here, C\ diverges like e~ l and 
C '2 diverges logarithmically, but the remaining two constants actually converge to finite limits as 
e —» 0. The reason for these definitions is the following result, which is the crucial point on which 
the proof of Theorem [53] hinges. 

Theorem 5.11 Let £ £ = g e * L, for some space-time white noise £ and, for c G R and e > 0, let 

m C;S = Mc^ceife) , 

with M re as above. Then, the limit lim £ _>o 2J1 C£ exists in the space of admissible models on (T, Q) 
and is independent of c. 

Remark 5.12 It is not true that the models 9Jl Cj£ converge to a limit on the extended regularity 
structure (T ex , Q). Fortunately, this is not needed for our argument. 

The proof of this result follows the same line of argument as in Section l4~6l The reason why 
the limit turns out to be independent of c is that the value of c only affects I1 z t for symbols r 
containing at least one instance of £. Our choice of renormalisation constants is then precisely 
such that, in the limit e —» 0 , all of these terms vanish. 

Combining Theorem 15.111 with Proposition 15.101 and the fact that solutions to (15.81 ) depend 
continuously on the underlying model, we conclude that the solutions to 

d t u = A u- H 3 (u, Cf } ) - aeH 5 (u , C[ e) ) - (9 Cf + 20 aCf + 25 a?Cf)u + £ £ , 

as well as those to 

dtu = A u — u 3 + (3(7{ £) — 9C < 2 £) )« + £ e , 

both converge to the same limit as e —» 0. Since the limit of the latter is the model by definition, 

the claim of Theorem 15.51 follows at once. 
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